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<400> 1 

ttgttatatc agtaatggct attgctccaa taacttaata ctaatatata attagtttat 60 

gaataaattt tatatatttg ggttattgtt ttttacacta aatgcatttt cttgctcaga 120 

tcttctagat acagacatga gagttcttga ttccgctgag tcaagaaacc tttgcgagtt 180 

tgaaggaaaa gctttactag ttgtgaatgt tgcaagtaga tgtggttaca cttatcaata 2 40 

tgctggcctt caaaagttat atgaaagtta taaagatgaa gattttctag taattgggat 3 00 

cccatctaga gattttcttc aagaatactc tgatgaaagc gatgttgcag aattttgttc 360 

tacagaatac ggtgttgaat ttcctatgtt ctcaactgct aaagtcaaag gaaaaaaagc 420 
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acacccattt 


tataaaaaac 


ttattgcaga 


atcaggtttt 


actccctcat 


ggaactttaa 


480 


taaatactta 


atctcaaaag 


agggcaaggt 


tgtatccaca 


tatggatcaa 


aggtaaagcc 


540 


tgattcaaaa 


gagcttatat 


cagctataga 


aggcttgctg 


taaaattatt 


acttagaaac 


600 


taatacagtt 


ttaggcttgt 


ttgctgcaaa 


tattccatta 


tctacaactc 


caggaatatt 


660 


attaatcaaa 


gcttccattt 


cagtggggtt 


tgaaatatcc 


atattagaga 


tatctaaaat 


720 


gtgattacct 


tggtctgtta 


taaatccagt 


tctatatgtg 


ggtattccac 


cgatcgagat 


780 


tatttttctt 


gcaacaaggc 


tcctactttc 


aggtatcacc 


tctataggca 


gtggaaaagc 


840 


tcccaaaaga 


ttaaccatct 


ttgactgatc 


aactatacat 


ataaactcgt 


tagaggcaga 


900 


agcaactatc 


ttttctctag 


tatgtgcgcc 


accaccacct 


ttaataagac 


aattttcagg 


960 


agacacctca 


tctgcaccat 


ctatgtaata 


agctatatca 


actacatcat 


taaggctaaa 


1020 


gacctctatc 


ccattttcat 


ttaataattt 


tgatgaagca 


tctgaactag 


aaacagctcc 


1080 


agcaaatttg 


tgcctatgct 


cctttagttc 


ttctataaaa 


aaattaactg 


ttgagccggt 


1140 


tccaatacct 


aaaatcatct 


caggatgaag 


attatttttg 


atatattcta 


tagcttgttt 


1200 


agcaacattt 


atctttgagc 


cactcataga 


gttataatac 


aagaaaatat 


aggtagttaa 


1260 


ttattfctgag 


actaaaaatt 


aaaaaaacag 


gttcttttaa 


gaattcccag 


aagtacctaa 


1320 


agcttattcc 


taatgcatct 


gtttatgacg 


tagcattaaa 


atcacctata 


acatttgctc 


1380 


taaatatttc 


ttcaaagctg 


gggaataaag 


ttttcctaaa 


aagagaggat 


ctgcaaccta 


1440 


tattttcttt 


taaaaacaga 


ggagcgtata 


acaagattgt 


aaatttatcc 


gatgccgaaa 


1500 


agaagagggg 


ggttattgct 


gcatcagcag 


gaaatcatgc 


tcaaggggta 


gccagtgcat 


1560 


gtaagaaatt 


aaaaattaat 


tgcttgatag 


ttatgccaat 


aacaactcca 


gaaataaaaa 


1620 


taaaagatgt 


aaaaagattt 


ggagccaaaa 


tactccaaca 


tggggacaac 


gtagatgcag 


1680 


cattaaaaga 


ggcactgttt 


attgcaaaga 


aaaaaaaatt 


gtcttttgtt 


catccttttg 


1740 


acgaccctct 


aacaattgct 


ggccaaggga 


ctataggaca 


agaaattctt 


gaagataaaa 


1800 


ataattttga 


tgttgtcttt 


gttccggtgg 


gaggaggagg 


tattctagct 


ggtgtatctg 


1860 
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cctggatagc 


acagaataat 


aagaaaataa 


aaattgttgg 


tgttgaggtt 


gaggattccg 


1920 


cttgtcttgc 


tgaggccgta 


aaagctaata 


aaagagttat 


tttaaaagaa 


gtgggcctct 


1980 


ttgctgatgg 


ggtggcagta 


tcaagggttg 


gaaaaaataa 


ttttgatgtt 


attaaagagt 


2040 


gcgtagatga 


agtcattaca 


gttagcgttg 


atgaggtctg 


caccgctgta 


aaagatatct 


2100 


ttgaagatac 


aagggttcta 


tcagaacctg 


ctggggcatt 


agcacttgca 


gggttaaaag 


2160 


cctacgcaag 


gaaagttaaa 


aataaaaaac 


ttattgctat 


aagttctggc 


gctaatgtaa 


2220 


atttccaaag 


acttaatttt 


attgttgagc 


gatcagagat 


tggtgaaaat 


agagaaaaaa 


2280 


tattaagtat 


caaaatccca 


gagatacctg 


gaagttttct 


taagctttca 


aggatgtttg 


2340 


gcagctctca 


agttacagag 


tttaactaca 


ggaaatctag 


cttaagcgat 


gcatatgttt 


2400 


tagttggtgt 


tagaactaaa 


actgaaaaat 


catttgaaat 


cttaaagtcc 


aaattaaaaa 


2460 


aagcaggctt 


cacctttagc 


gactttactc 


gaaatgaaat 


atccaatgat 


catctgaggc 


2520 


atatggttgg 


tggcagaaat 


agtgactcag 


gctctcataa 


caatgaaaga 


atatttaggg 


2580 


gagagtttcc 


tgagaagccg 


ggcgcgctgt 


taaattttct 


agagaaattt 


ggaaataaat 


2640 


ggmatatttc 


cttatttcat 


tacaggaacc 


taggttcagc 


ttttggaaag 


atattaattg 


2700 


gcatcgagag 


taaggataaa 


gacaagctaa 


taaatcattt 


aaataagtca 


ggnactattt 


2760 


ttacagaaga 


aacctctaac 


aaggcataca 


aagatttttt 


aaaatgaaag 


gttaatactt 


2820 


taatctaaat 


ttaattgaaa 


aaagctcatc 


gctagggttt 


tcccacggct 


ctttgaacaa 


2880 


ctcggattga 


gatctatcat 


cctcctcgtc 


gtaaattctc 


ccacctttag 


aatagaccaa 


2940 


aaatagatat 


gacaaaggag 


cgagctcata 


tttatatcta 


atttggaacg 


aagctacgcc 


3000 


agtattaaat 


tcattaacta 


tattatttcc 


tttataaagg 


tatccattag 


catctgaaaa 


3060 


aatactaata 


ggattttttg 


ctttcaaagc 


aacaaattga 


ctcttaagtc 


tgatctcatg 


3120 


tttattattt 


ttaaaccaat 


ttagatcaaa 


agataaggta 


tcttgtcttg 


aatcatatga 


3180 


ggcaagatta 


ttattatcct 


gccatatcag 


ccattcattt 


tcctttctta 


ttctatattg 


3240 


cgcattgatt 


cttaagttat 


catttggaaa 


tattgaacct 


gctatcttgt 


aaaattttct 


3300 
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accataccca ttcgaatccc accgattatc tttctctccc ttaaagaagc taactctcca 3360 

gtcatatgtc cagaatgaat agttctttgc ctcaaagtct gctgtaatac ctattcttct 3420 

ttttgatttg ataaaggggt aggcttcatt ttttcttgtg atagttgtat ttttcccaga 3480 

agatctaaag ttaaaatcta attgaaattt agagttgtcc ttaaaactaa aagaattttt 3540 

ttgatcgatg cctattggat ttgaattacc agctgtgtca gcatcataat ttagatcaat 3600 

tccataatct atttgtttta atatcgagct attatcaaat tcatttattt ttcgatttcc 3660 

tccaatacca gcatgaatcc agtctcttct ttgcagataa ccaaagtcat ttaactcgaa 372 0 

gtcgtcttca aaataaagaa ggcttccact tatgttagat agtttatttg gaagatatgt 3780 

aaactgagtc ctatacccaa gcccattttt accatctttt tctgaggcta acagatctga 3 840 

atatgtaatt aatttttttg aacgaatatt gatgtaatca ataacattga ctgttgatga 3900 

ttcgcctgtc atttcattct caacattcgt taccatgaag ccaagcgttt tatttccaag 3960 

ctttgttcga gatcgaagag cataatagtc tcttccaact gaaaaggctt catcagcctc 402 0 

acttgctaca aatactccaa attcattatt attacttttt tgagtaagtc ttaatgcaaa 4080 

atcaatatca gaatagtttt ttttagctgc ctcgcaacct tcttcattac tttcttctga 4140 

gcaattatag ctgggggcag ctccaatcct ccgcgtattt ataaccgagt atctatcata 4200 

attactaata tcaaatagtg attggttttc attgaaaaat gctctttttt ctgagtaaaa 4260 

agtttcttga gcagaaaagt taataaccac atcatcactc tcagcttgtc cgaaatctgg 432 0 

attaatagct aaatttattt gacgaccttt tccagtgcta taaaagattt cagccccaat 4380 

atctgaccct tcttggtttg taactgaatt tttatttgaa gatatatatg gaaaaaaagt 4440 

aagctttgat tttgtatagt tttgtatttc taagctatct aactcttgaa agtagtcatt 4500 

tctactagcc attgttccgg cactgctaac ccatgactca ttttcggcac tataacgtaa 4560 

tgcggtgtaa ttaatttttc ttatatcacc atcaggctgt ttcattaacg ttacatccca 4620 

aggaataaaa aactcagaga cccaataccc atcaaatttt tgtgtttttg caatccaatc 4680 

tccatcccag tctgtcttaa agtctcctgc ttgcgttttt atggcatcga aaagcgagtt 4740 
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cccaagattt atagcaagaa tgaaagcttt gttaccatca ccatcaaagt ctatatttat 4800 

agagttttta tcgctaagtg aatttatttg atctctaagc gtcttcctgg agaacataga 4860 

atcattactt tgaaaattct taaatccaac atatatacca tccttatttg agaaaattaa 4920 

agccgttgta agaagttcat ttttcttaag agtaaaagga gatgtttcat aaaaatctgt 4980 

aatttcaaat gcattattcc actcaggctc atcaagagag ccatcaataa caattgagtt 5040 

tgaccaaatc agcacagagg taagtaaaag tgataatgag gctaaagtta ttttcataga 5100 

tagatttaat tcagagtatt ttaatctatg aagagtagga aatcatccga tcattctcag 5160 

aaaaaacata atcatattta aggtcatgtt tttctccaaa atgcatatta caaatctggt 5220 

attgataaca cagtccaacc aataaaggtc ttgatacctc ctcgtttata gagccaataa 5280 

ttctatcgaa atatccagag ccatagccaa gcctgtatcc atttaaatca actcctgtca 5340 

taggaataaa cattaaatca atttcattta tgttgacata atcttcactt ttaacctctt 5400 

tgatcccaaa ttgatttata aagaagttag gctgctcatc cagcaaatta aaagccatca 5460 

tttcgtcatc aattactttt ggaatataaa tattcttttt aagtttagta aaagcttgaa 5520 

ttaacaaatt tgtattgact tcatttcgaa aaggaaaata taaggcaatg ttttgcattt 5580 

catgagtatt gattttttct aatacatttt cttggattaa aaaacttata ttgctcttag 5640 

ataaatcaga aatagactgc ccctgctcaa agagtgattt tcgtatttta tttttcacca 5700 

taaattgtgc cgaggctaac aaaccaaatc accgccatgt gctatacctg aaccgatggg 5760 

tcaggtggtg aatttcataa caaatcaggc ttccctgtaa aggtactgcg caacaatgct 5820 

agaggaatat cacctattta attgtatcgg ttcaaattta ttaacacatt agcgtatgaa 5880 

ccagaatgca gttaattata aaatatatag gtattaagta aaagttaatt tttagagagc 5940 

agactctatt ttttgtatta gcttttcgat atctttatta tttaaagcag agtcagtatt 6000 

ctcggttggc ttggataaaa gcttactagc catagttaga ccagctatca ccaaggcatt 6060 

attcttatca ctgatgccat caagctcatc atttaataat tttgcagctc taatgagttg 6120 

gtctttctct tctggcggac aggctaacgt cagatctcgg ccaaaaattc ttaaagatag 6180 
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cgtttccatt 


tctgacatta 


ttttttgagt 


ttagttattt 


ctttatttag 


tatgagctta 


6240 


tcttttttcc 


agtctctttc 


cctttgcaca 


taagaatcta 


ttattccttt 


ttgctcatca 


6300 


aatcttttta 


tcaaaagatc 


taccttatcc 


tctagttcaa 


ataatgatga 


ttttttttct 


6360 


tctgacattg 


tataagttta 


attgtactag 


aatgaattgg 


aaagtttctt 


taataaaaga 


6420 


gtaaaataag 


gcatggaaaa 


aataattttt 


aaaaacagaa 


gagattcact 


tataaaacac 


6480 


ctacctaaga 


attcagcctt 


aattgtacct 


ggtgcagatt 


tgcaatatag 


aaatgctgac 


6540 


tcatcttata 


atttaagaca 


ggaaagtagc 


ttctactatc 


tgtctggctt 


ttgtgagccc 


6600 


tcttctctaa 


tggttttagt 


taataatgga 


aaaagcattg 


attcaataat 


ttttgttcct 


6660 


gaaaaagata 


aacttaaaga 


aatctgggat 


ggttatcgag 


ctggccctga 


gggcgcaata 


6720 


iStgattttc 


tttttgatca 


agcttttgaa 


aataataaat 


cagatgcttt 


aatgcctgaa 


6780 


itccttcaag 


ggctagaaaa 


agttttttat 


tcaataggga 


agaaaaatgg 


ctttgatcag 


6840 


fjlagtaattg 


actggacatg 


cgcagcaaat 


tctaaagata 


ggcacagcaa 


atcaattgat 


6900 


fittattgatg 


gctcttcgat 


ggtaggaaat 


ttaaggctta 


tcaaagataa 


gcatgaaatt 


6960 


4&tattatga agagagcttg 


tgaaatttca 


gctgaatcat 


atattgaggt 


catgaaatct 


7020 


fiaaagcctg 


gggacaatga 


gcaggaaata 


gaggcgctat 


ttttatatga 


attcgccaaa 


7080 


#f gggaggaa 


ggtttccagc 


ttatacacct 


atagttgctg 


gaggtgaggg 


tgcttgtgta 


7140 


ttgcattata 


ttgaaaatga 


taaagagtta 


gcttcatcag 


atttaatttt 


ggtagacgca 


7200 


ggatgtgaat 


acaaaatgta 


tgcatctgat 


atcacaagaa 


ccttcccagt 


aagtggaaaa 


7260 


ttttcagatg 


aacagctaca 


aatttataat 


attgtccaca 


aagccaatct 


tgctgcaatc 


7320 


gatgctgtaa 


aaactggtaa 


tagcataatg 


gagccccaaa 


tggtttcaga 


aaaagtaatt 


7380 


actgaaggtc 


ttgtagagtt 


gggtattcta 


tctggcgatg 


ttaatcagct 


tcataaaaat 


7440 


ggtgcattca 


aggactttta 


tatgcataag 


gtgggacatt 


ggcttggact 


tgatgttcat 


7500 


gatgttggtg 


actacatgga 


gggagatgag 


tttatgaagt 


ttaagccagg 


gatgataacc 


7560 


acaatcgagc 


caggcatcta 


tatcagtagc 


gcaatggatg 


tagatgacaa 


atggaaaggc 


7620 
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atcggcataa gaatagagga cgacatcctt gtaacagatt caggcaatat taatctaaca 7680 

gagaaggtgc catctaatcc tcaagaaata gaatcattga tggcttagac tatggaggtt 7740 

ccaattgtta tttctggcgg agggataata ggtaattaca tttctcttag gcttgaaaaa 7800 

aataatatca aaaccgttat tgtcgaaaaa gctagtagtt tcaaagccct agataagggt 7860 

ataagaacag tcactctcaa tgagcattct atgcaaatgc taaaaaatat tggtatttgc 792 0 

ccatcaattg ctcaaatcaa cagcatcgac gtattagatg gtgagggtac aggcaaaatt 7980 

caatttctag caaaggacgt aggcagcgaa aacctttcat atgtaaccta tttcaatgaa 8040 

ttacaaaaac taatttctga tccatgtaaa gaaagaacct tatttaataa tgagattgat 8100 

tcagttcaga atcttaatac agaatctgat ccagagatca tgcttaaaga tggcatgacc 8160 

^ ataaaaacga atctaattgc tggatgtgat ggaagaaatt caaatattgc aaaaattgct 8220 

-^ftcacttacaa gcagcttcga tgactactta caaacagctt taacttttgt cgttgatatt 8280 

f!gataatgatt cacatggcaa agctcaccaa gttttttctg aaaaaggaat atttgcactt 8340 

rvatgccactcc cagaaggcaa gggtgagatg aataaatgca cagtggtttg gtcaataaaa 8400 

;"aatcaagttt tgggagatga gcctgtatct gagtttgtaa aaaataacat ttcttttttt 8460 

T^gaatcaaagc ttaatgttag tctcagggtt aagtcagaaa ttttaagttt taaattatcg 852 0 

klaaccatcatt ttgaaaacta tattagcgga cctattgttc ttcttggtga tgctgctcac 8580 

ytcaattcacc ccttagcagg tcaaggtatt aatctaggat ttgcagatgc agatactttt 8640 

tgtgaagagg taattagttc 'ttataaaaaa gggattgcct ttaatgagaa atcagtttta 8700 

aaaagatatg agattagaag aaaaagtatg aactttttaa tgttgaagtc tatggacttt 8760 

tttgtggatt tatttggttc agaaaattta tatctaaggc tgatcagaaa tttaggtatt 8820 

tcttcgttaa ataaatcaaa gtttgttaaa gcattcttta taagacatgc ctctggaatg 8880 

aataagtttt aaatttgtat taaacttttt gacctttagc tctaagttct ttaagaactt 8940 

cactaatgcc ttttttatca atgattctca taccttttgc agatacttta agatttacga 9000 

acctgttctc agattcaacc caaaatttgt gtgtgtgaag attaggaaaa aactttcttt 9060 
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tagtcctatt 


tttagcgtga 


gaaacattgt 


ttcctgactg 


tggtatctta 


cctgttactt 


9120 


gacatatttt 


actcattgaa 


acgcgatttt 


atagaacact 


gaggaactta 


gcaatactat 


9180 


tgtgaaacaa 


atttatttat 


tacggcatgc 


acaatctgac 


tgggagagct 


ctaatcagaa 


9240 


agattttgat 


agaccattag 


caagaaaagg 


cattgaagaa 


gcaaataaaa 


tatcatgtta 


9300 


ctgcaaatct 


cattcaattt 


tagtagataa 


aatattctgt 


agcactgcag 


aaagaactaa 


9360 


gcagactttt 


gatatatgca 


gtgatgggct 


taattatcca 


atagctgaag 


cagtctatac 


9420 


tgatgagctt 


tacttttctg 


gccctggtga 


gatcgttaag 


cttatccaaa 


gtttaagtga 


9480 


attcatttcc 


tctgttttaa 


taataggcca 


caatccatca 


atgcaaatgt 


acatagatgc 


9540 


tatttcagaa 


aatcctcata 


ttacgtattc 


aacatgcggg 


ctggcagaaa 


ttctcgttga 


9600 


5=^ aagttcatgg 


aaagacttat 


ctttaaaaaa 


atgtaagtta 


aaatctttta 


ttcaaccagg 


9660 


^agagctttaa 


aaattgaaaa 


acataaaaat 


taagatcatt 


aacccactaa 


tgggatccaa 


9720 


-Ngataccctta 


cctcaatatg 


aaacaaaggg 


ctcggcagga 


ttggatttaa 


gggcatgcct 


9780 


£1 agatagtaat 


ctcagccttc 


aagcaggaac 


atctcagttg 


atacctattg 


gttttgcaat 


9840 


s gtacttagaa 


gatcccggtc 


ttgcagcaat 


ggttatacct 


agatcaggtt 


taggttctaa 


9900 


IJIgcatggaatc 


gttcttggta 


atctggttgg 


gttgattgat 


tcagactatc 


aaggagagct 


9960 


Maatggttcct 


gcctggaata 


gatcagatac 


agattttgag 


attaatcctg 


gagacaggat 


10020 


Mtgcacaaatg 


attatagttc 


cagtgattca 


agcagatttt 


gaaattgtag 


acgagttcaa 


10080 


tgagactcag 


aggggagaaa 


agggttttgg 


aagttcaggt 


ataaattgat 


aaatttactt 


10140 


tttcttgcca 


aatctttctt 


caaatttctg 


aactcttccg 


ccagtatcaa 


taattttttg 


10200 


cttaccggta 


taaaaaggat 


gagaagcaga 


ggatatatca 


agagggtagt 


atgggtatgt 


10260 




t— j~t s~i s~t 4— 4— f* 4~ 4— 


tcguuxgagu 


cgcauccaat 


gttgaacgaa 


tgagaaagaa 


1U 5Z U 


cttatcagca 


ctagcgtcat 


ggaataaaac 


ttcacggtat 


tcaggatgta 


tatctttttt 


10380 


cataataaaa 


tcaaaatttg 


gatgagaact 


ataacaaaaa 


acaacttctt 


ttcaatcaaa 


10440 


acatgaaaat 


tttttactat 


gacatagctg 


tttcattgcc 


tctaaggcaa 


tgctttactt 


10500 
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atagttctga gcttaaaatt acaaaaggaa cccgcgtgtc agttcctttt ggtaaaagga 10560 

agattgtagg agtggttata aaaaatattc aaaagccaga tttcctaaaa aaagcgggag 10620 

ctataaaaaa aattatcgct gtacttgatg aatatccttt gttcgacaag ccaatttttg 10680 

attctatatt gtggtcttct gattactatc atcatcctat tggtgaggtt tttaatacct 10740 

ttatacccac cgaattaaga aaaattaata ataaaaaaat tgaagcttta agagaatttt 10800 

ctgaatattc agtaaatgag gatgataaga aattcgattt aaccaaggat caagaaaaag 10860 

cagtcaaggc gctttctaaa tctaaaggat tttcacccac tttattatat ggagttacag 1092 0 

ggtctggaaa aacagaagtt tacttaagag ttgccgaaac ttttattaaa aataataagt 10980 

cagtattagt tttagttcca gaaataaatt taacccccca attgctctcg cgatttgaga 11040 

atagatttaa tggtgaaatt ggcatatatc attctaagca aacagcagct aagagattaa 11100 

agacttggct aaaagctaaa tttggttcta taaaaataat agtaggaact cgatcttcgg 11160 

ctttagtgcc tttagataac attggtttaa taattatcga tgaagagcat gaccaatcat 11220 

ttaggcagtc agaagggttt aaattctctg ctagagactt aagtataaaa agggcacagc 11280 

ttgcagatat tccaattatt ttgggatcag caaccccttc gctgcaaact ttaaaacttg 11340 

Itaaaagaaaa taaatttata agagttgata ttcctaatcg agttgatgga aacaagcctc 11400 

: ^ctaaattaat agccttagat atcaataaca gccctttaat aggcggagtt gctaaagaga 11460 

: -caattgaagc aatgcaatca accatagaca gaggagaaca ggttctagtt tttattaata 11520 

gacgaggatt cgctccactc tatcaatgta gtagttgtgg gtgggtagca gattgtaaat 11580 

cttgtgatac aaatttagtc ttccaccagg caagaaatag attaatttgt cataggtgtg 11640 

aatctgccta ctctgttaat ttgtcttgtc cggcatgcaa gtctaatgac tttaatatgt 11700 

atggagctgg aacagagaga gttgaagaag ttcttaaaag cagctttgta aagactccaa 117 60 

taattagagt tgatcatgac tcaacaaaaa aagtgggagc tatggaggct atagttaaaa 11820 

aaattcattc ctcagacgca gcaattttag ttggaactca aatgcttgca aaaggacatg 11880 

attttcctaa agtcacctta agcgttattt taaatgctga taatggcctt ataagcccag 11940 
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aaattaatgc attagagaaa atatctcaat tgcttattca ggtctctgga agagcaggaa 12000 

gaaataataa tcttgcaaaa gttattattc aaacaagata tcctgatgat ataaatctta 12 060 

ataaaattaa gacaggagat tatatgaaat ttgcttctca atgcctaagt accaatgagc 12120 

aaatgaactt acctccattt actactttat gtctgcttag gtgctcatca ccaactcaaa 12180 

agagtaatgt agatttccta gagaaagctg ttttaatttt atccaatagg actgatataa 12240 

atgttattgg tcccttgcct tcattagttt cgaagtcgaa aggaaattat aggcaccaag 123 00 

tctatatcca tgcaccaaag aagacttttt taaataaggt attaaagttt ttgacaacag 123 60 

agtttgaaaa atggccggaa tctaataagg ttaagtggtc tttcgacatt gatccaatag 12420 

acttaagcta aatattaatc ttaattaatt gtcctgggta tattggttta ttgtttagtt 12480 

tattctctgt attaatttct tctacagtca ccccaaatct tatcgctatt tctgataaga 12540 

catccccttt ttgtattttg taagtcacaa agcctggatc aatactcata aaggtatttg 12600 

ctttaggttt gtccttaaaa tagttatgta ttcctaggaa aacagatctt gcaatcattc 12660 

ttctccctgg cttgcccttt aatctttgtg cgtcttcagg gttggttata aaccctgact 12720 

caaccaatac agaaggtata tcaatagact taagaactct gaaatcagcg tactcaacat 12780 

tctttttatg aatttttgtg aaagggtctc ttttaagttg atccaatatc ttagttccta 12840 

aaattttact ttcagaaatt tttttcttat atatttctgg ataggtttgt cttgccgcat 12900 

cctcatcaaa atcaactggc tttagatttt ttatatcagc ctgtattctt tccctttgtt 12960 

ttttagataa gttcctagca acagtacttg aagcttcatc tgaccatata aaaacagaag 1302 0 

caccctttac ggaagataat ctaaacccat cagcatggat tgaaacaaaa atatctgctc 1308 0 

catatttcct agcatcctgg tatctattat ttaaatctaa tgtctcatct ccatctctaa 13140 

tcatcaccgg tctgtaccca taagtatctc ttaaggttct ttctaactcc ttcgcaataa 13200 

gtaacgtcac atctttttct aaaatattat ttgggccaac cgcaccaggg tatttaccac 13260 

cgtgacccgc atctatagca acaacaatat ctctaatact tttattgagg tttttatttc 13320 

ttttcacctt aagctctaat tttatatttt ctgtatttat agtctgagtt gggttttgcc 13380 
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aatggactga ttcatataaa tcaacaacga 
ttactttttt gattggatag ttgtatggaa 
taatttccat gacaattcta gagggatttt 
atctagacta aagttaatac taatttcatt 
aacatcattt ccgctgataa aaaaagatag 
tttttaacca ttctttaaag ccattgtcac 
gaagatggct gaaaattatt tttagatcaa 
actcaattag cactactttc ttagagttta 
cttctgcttc gttagttcta taaagatcaa 
attcctcgca gagagtgtaa gttggacttt 
taatagatct agatataaag gttttgcctg 
tttcttgcga agaagattta agtatctcca 
cctcgtcatt tataagagta agttttttca 
taaactagag gcatttaagc ctatttcacc 
•^cgaatgaaca gcaacagcaa taatgctcgc 
-agcagtcaaa acaccagcca gaacgtcacc 
^gcacgcgcat ataaaagact tgttatttgt 
taatatgaca atagaagccc cataagtgtc 
tctttgaact tcttcaatag atatgtttag 
catcaagatt gttttattgg attttttgat 
agcacctgca tctaatataa tttttgaatt 
agtttttgca aaagcagtat tagcaattcc 
aatttttaaa tctatatcga ccccttcaac 
gtttcttaat aaagatgggc ttacattaga 



ttctggtaag acttccatcc tgagatgctc 13440 

catttatctc ggtcggaagg ctcgattgat 13500 

taaagaataa gaccttacca atgaaacttt 13560 

gttacccatg tcttgtattt catcgaagaa 13620 

aaaaccaagg attgccaatc tattcataag 13680 

cagaaattaa agaaacctct cttccttcag 13740 

aacttctttc atgttgaagc ctttcaggcc 13800 

tttttctact tagatcaaat atatcaatat 13860 

tatgcaaaaa cattaaatta ttaaaatcat 13920 

ttacaagatc cttccatcca caatttttta 13980 

ctcctaagtc tccctcaaga tgtatttcaa 14040 

tagctatttt tgaccctagt tggtttgtag 14100 

tcgatttatt aattgtctta ttataggtat 14160 

aatatctacc ttaaatttta aacctgcctc 14220 

atttcttata tcaagacctt gggcaagaag 14280 

agttcctcct gaagcaagtt caggtccgcc 14340 

tgagtcaaag accaccgttc ccattccttt 14400 

tgcaatttgc ttggcagcag aaatcctatc 14460 

taatatagcg gcctctcctg gatggggcgt 14520 

taaagatttt gatgatgcca cgatgtgtaa 14580 

atttgcagac tttaatattt tacccaatat 14640 

tggtccatac aatataacgc tatgatcttt 14700 

catatcaaca ccaattgcca ttacctcagg 14760 

cgtgtctgtg acgagagtta ctagtccaga 14820 
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gccacaaaat aaacttgcct cagatgcaag tattccggcg ccacccatgc ctggggagcc 14880 

agcgcatatc aaaacttttc caaaattacc tttatggcta tcctttgctc tattgggcag 14940 

aagatttttt aattcttgaa aggtaaaact ttgcaatata gacatgaaat tacttaagct 15000 

aaattatgtt gaatgctaaa aattcattct tcagcgaaat aaccaatcct atcggaatgg 15060 

aatgatccca taaaaatttt ttcgtcatga ggcactgcaa tagtgggtag gccaaaaact 15120 

gttctagaaa atgaaaactt atcttctctc ttaagtgttt tttgattaag tttatgaatg 15180 

gagaagggta gagaacaatt tgttttttct gcacaattac caaaatcatt agcctgtaaa 15240 

tctaaagaag ttaaccaaac cgaaccgtct tttaaaaaga tattatctgg actctgaata 15300 

aaataactac ctgttttagt attttgatta atatcgtaaa cagataaatt gtcaccttga 15360 

Jttatagttaa cataaagaag tccagagccc tcatctagca aaattccatt aggcccactt 1542 0 

jccatctgttc catcaacttt attaaaatta ttgtcactcc aaagtactac atgccctgaa 15480 

Jatacttttaa ataatgaatt cattaaccat tctttcatgc tgatatctct tttatacata 15540 

^tgagatgcat aaaaacttcc atctttttta agggcaacat cattgaaata atattggtca 15600 

gggaccctaa tacagccccg ccatatcata tcccatgaag attcattttt tataatttca 15660 

Jjaacatttcaa tcgactcaaa tggcgaatga ttaattacag ctagttgata gaagccttca 1572 0 

jtcattttcaa caagatctat tccgtgggga ttgaatatgt ctagctcgcc tcttatgcat 15780 

gaagcgtcac cccaagagct ttctccaaaa gtaatttttg gtaccttttt ttcataggta 15840 

tttaaatcca ttaaggcaaa gtaaccaggg gtatgttctg catatggacc gatccctcca 15900 

aattcagaaa taaagaagaa tttattatca ggcgtaatca caatatcttc tgggtttgag 15960 

aaattacaaa taaccttgat acggtcatcg gattcacact tactaatatc catttggggg 16020 

cctatatagt cagtagatac gacagttaca gaaataaata aataaagaat agaaaccggg 16080 

acggtaatct tataatagtg ctttataaaa atttctaaaa tctttgaagc atgatttgga 16140 

agtgcgatca gccaaacccc ttttaaaaaa gatagagccc ccataacaac aaaaattact 16200 

gaccatagcc tgtcagacca ttccggctga agaagaccag ttaaaaaaag aatcatccca 16260 
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aagaataaag ctaaatagcc tgagattttg actttcgatc caacaaattt cgaaaagaga 16320 
tatgtataaa gaggctttat aagaactagt aaaccacagg ccaagaaaaa gaatgctaag 16380 
tagtaattca taagttagtt tttatataaa tgctccttaa taatactaac aagttctaag 16440 
ggcttgtcca atggaacatg gtgagcagct ccaggaaccc cctcaaaagt cataatgtca 16500 
ccatatgtat ttttaatatt gtccaagata cttccggagg ttaataagct gtcttcaccg 16560 
tggatgaaca aagcagggca gccaaatgaa aatgtgtaac cgaataacct ttcaagactg 1662 0 
ctaaacatga catcatcaaa tttccatctc cacccagcct caatattttt tactgagtgc 16680 
tcagcaatgt atcttaagta ccaatcattc gtacaatctt gcttaggcat taacctaaac 16740 
cttttaataa tatctgtctt gtcttgatag tgcttgatca ttctgagagg agaagagtgt 1680 0 
ptgattcgggt cataatccgg tggtcttata aatgtatcaa taataataat attatttatt 16860 
ggagatcctttc tttcagatgc aacgtaacca gcaacatgtc cgccgaggga gtgtccaaca 16920 
HJataaaaatat ttgaaatatt ttttttatcc ttttcctttt caattacaga gacaatacat 16980 
N=tctccaaaat ctttaatgcc atatgaatct ctaaaagaag agtcacccat gccaggaaga 17 040 
L tctatfcgcaa ctatatttgc gcagtctcta aagtggggcg caataggatc ccaccatttt 17100 
^jttatgagcac ctgttccgtg aataagaatt attaaatctt tgctttcatc tttggagttc 17160 
JTcagctagaat aggatatatc cccatgagga ttcttgataa tctctgagct aggcttgtcc 1722 0 
r ~tcaatggcat ctttgaacca ctgtggggca tgaataatgt cttgatttag attgttagtt 17280 
atttccataa acagtattct aagctataaa aaataaaaat atgaataaac ttaatttaac 17340 
gccagcagca actgttttag tcctaaagga ttctcctgat gggatggaag ttttgatggt 17400 
aaaaagatca agtaggcctc ccttcggaga cctttttgtt ttcccgggcg gcaagattga 17460 
cgaaggtgat ttcaataata agatagaaga tttttgtgag ggcgtgactg ataaagaggc 17520 
ctccataaat cttggattag attctggagg tctagcatat tgggttgcat gtattagaga 17580 
atgctttgag gaggttggaa ttttacttgc taaaaaaaag agtggggaag atcttgatct 17640 
agatggagtc gataaacata aatatcaaaa atatagagag atgttgttaa ataatgaaat 177 00 
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tgatttatat aaaatctgtt tagaagaaaa tttaattcta atgcctcaac aaatagcccc 17760 

tttctcgcat tggataaccc ctgaaataga aactaggaga tttgatacac gtttttttat 17820 

tgcccacctc cccaagcatc agaccggaga acatgatggt agtgagctca tagacagtgt 17880 

ttggatttca ccaaaagaag cgctcaaaaa atctcgttcg ggtgagatgc ctatgattat 17940 

gcctacaata aaaaatttgg aacaatgtgc acaatttgat tcgggctcta agcttttaga 18000 

aaatcagagg aatctctcaa atgaggatat cccaccaatc ctgccaaagt tttttaaaga 18060 

agatggtgag tggaggggtc tattgcctgg agataaaggg tatgaggatc attaaataat 1812 0 

atggacttaa ttactaaaat aacagctccc aaccctggtg ttttcactgg gggtgggact 18180 

aatacttatt tgattggcaa agatgatata acccttgtcg accctggtcc aaatatatct 18240 

pgagcatctag atgaaattat caaagcaggg gatgggaaaa taaaaagaat ctttgttact 18300 

mcatacccata cagatcattc cccagccgca ttgcctttat caaaaactct taatgttcca 18360 

-Jatgtacggaa ggctagtaga tggtgaatcc tcatgggagg atgaaacatt tatcccagat 1842 0 

^attattttaa atgataaaga tattattgag acagacgaat atacgttaga agtaatacat 18480 

«jactcctgggc acgcatctaa tcatttatgc tttttaataa aagatacgaa atgccttcta 18540 

yjacaggcgatc acattatgga cgggtctacg gttgttattg ggccaccaga tggcaatatg 18600 

j^acaagctata tcaattcatt agaaaagtta ctagattttg atattgattg ctttgcgcct 18660 

^gggcatggaa attatattca tgagcctgag aaaaccattc aatcaattat taggcacaga 1872 0 

ctaacaagag aaagaaaagc tcttagaaag ctaggagagg caggaatctc atcattagat 18780 

aaacttacta agcttgttta tgatgatgtg tcagagatgc tccatcctat agctaaatat 18840 

agtctagaag cacatttatt aaagcttata gatgaaaaga aagttaaatt agataaagat 18900 

ctattcgaaa taatttaatc ctttttattt ttatgtaaga ctttctcctc aatagcttct 18960 

atatcaatat catcaattga gtcttcgtta ttatcaggta tcttttttac atctttttca 19020 

attttaaggt cgataggaga ggctccaaga tcaaaagtta gctcccttac attttttgaa 19080 

atagtatcct caacacagtc atcttgatcg tatgcctcac gggtctctcc tttatcatta 19140 
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atgggaaagg gtctttgggg agggcccatt tgcatgcatt taatagtagc aactggtgag 19200 

tagtaatcgt cactagaata taacttatca agttcttttg gtgatattga gcaaccaata 19260 

attccaaata gaataaatgg cgctaagagt cttttcataa actttgtttg ttttctataa 19320 

gagactcaac aaccgaagga tcggcaaggg ttgtcgtgtc tcctaaatta gatagatcat 19380 

tctcagcaat ctttctaaga attctgcgca taatttttcc cgatcttgtt ttaggcaagc 19440 

ctggagcatt ttgaattaaa tctggttttg caatagctcc aatttcttta gcaacaaatt 19500 

gtttcaattc ataactaaag ttgtcatcaa atgattcatt tatcattaaa gtaacaaaag 19560 

catatattcc ttgcccctta attggatgat caaaaccaac aacagctgct tcagcaattt 19620 

tagggtggag cacaagagca ctttcaattt cagctgtacc tagtctgtgg ccagaaacat 19680 

^taagaacatc atcaactctt ccggttatcc agaagtatcc atcctcgtct cgcctggctc 19740 

gcatcaccagt aaagtaaata tctttataca taccaaaata ggtgtcgatc attctttggt 19800 

^jgatcaccata aatacttcta atttgactag gccaagattg ctcaataact aaattacctg 19860 

l^cattagagcc ttctagcgta tttccatgct cgtcatagag agatggctta actccgaaga 1992 0 

s agggcagagt tgctgaccca ggttttgttg gagtaatacc cgctattgga gagataagta 19980 

IJjcagaaccagt ttcggtttgc caccaagtat caataacctc gcaattagat ttaccgacaa 2 0040 

Hcactgtagta ccaatcccac gcctctggat taattggctc tccaactgta cctaaaattc 2 0100 

Httaggctatc tctttttgtc tttttaacag gatcatcgcc ttgggccatc agagctctaa 20160 

tggcagttgg agctgtataa aaaatactaa tgtcatgctt atcgcatatc tcccaacacc 2 022 0 

ttgatgctga agggtaggtt ggcactcctt caaacataag tgttgttgct ccatttgaaa 20280 

gaggtccgta caagatataa gtatgtcctg ttatccatcc cacatctgca gtacaccagt 20340 

atttgtcctc tggccttatt ccaaaaagat atttgaaact aatatgagcg cctaatagat 20400 

aacctgcagt agtatgtaga acaccctttg gcttgcctgt agagcctgat gtatagagaa 20460 

tgaaaagagg gtcttcggaa tccataggct ctggagcaca cttattagaa acatctttaa 20520 

caagatcttc ataccaaaca tcttttttat catcccaatt aatttcgcca ccagttcttt 20580 
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ttataaccag tgtattttta acatctggac agcccagaag agcctcatct acattagatt 20640 
taagtggcac ttttttgccg ccccttaaac cttcatcagc agttataaca attttacaat 20700 
cagcatcaag aattctatct ttgagtgatt ctggagaaaa gccaccaaag acaacagagt 20760 
gcacggcacc tattcgcgtg caagcaagca ttgcgaacgc agtctcaatg atcataggca 20820 
tataaataca aactctcgag cccttttgaa cacccaggtc ttttaaaaca ttagcgaact 20880 
tacatacttc gtcatggagc tctttgtagg tcaattcttt agaatcagca gggtcatctc 20940 
cttcccatat taatgctatt ttgtttggat cattttctaa atgcctatcg atgcagttta 21000 
agctaatatt tgttttacca ccctcaaacc acttcgcatt attaaattga ttattgaatg 21060 
ttgttttgaa gtcttccatc cagcttatgt tttcgtttgc tagattttta aaaaatttag 21120 
^aaggatcttc tatggattgc ttgtaaagtt ctttgtattc atcaaagtct tttatataag 21180 
Jgattacttga atgttttggg ctataaagct ttcgaggcat tcttaaataa ttgaaggttg 21240 
jggggttaatg aaattctttc ctttgggatt ggacattatt tttgtaatga gcgattcgta 21300 
^atcgctatca ttggtttcta aattaatatc tgcagcatta attattaata aaggggcaga 21360 
actatagtct aagaaaaacc ttgagtatgc atcattcagt ctttccaggt agtcaagagt 2142 0 
itagatattgt tcgttaatat ttcctctctt agtaatcctg tcttttaaca catcaatagg 21480 
^tgcctgaaga tagattacta ggtcgggtgt tggcgcgtcc agggttagat ggtcatatac 21540 
^tttgtcatat agatccattt cctcgttaga aagagtaacc tcagcgaata atcgatcttt 21600 
ttctattaaa aaatcagcaa ccctcactgt ttcaaaaagg cttctttgtt taagatcttg 21660 
aatttgttgc attctttgaa acaagaagaa aagctgagtg gctagagctg attggcttgg 21720 
gtttttataa aaattcttta agaagggatt ctctgccggt tgttctaaaa aagaatcata 21780 
attaaatgtt tcggctatct tatttgctaa agtagttttc ccaacaccta tcggtccttc 21840 
aattgcgata tattttggaa gtggtacttc tttaatagct gggttcattt ataacgtata 21900 
gtttttatcc tagattatct caactcgtta tagaagccaa atatttttgc agattcttca 21960 
tcactttttt cattcatctc agcaataatg gttcctctca atgagtcatc atttttttgg 22020 
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aaatcagtcc accacttccc aataccactc ttgtctccag aagctctctc tcttacaagg 22080 

agaaaatcat atgcagctct aaaccttgga tgtcgaagag ttttatatgg ctggctacca 22140 

attctgctat gaagttttaa ttgaagtacc caaatatcct taatatagct tgaaaatttt 22200 

cttggtattg ctgtgatttt ttgttgttcg cgaagtacac catccataga tcggaaaaat 22260 

tttctcacat taatttctcc attcttagaa cactttttca ataacgaggg ccataacaat 22320 

gcagccataa gaaagcctgg tgtaattgac tgttggtttt taaccctatc atcagtattt 22380 

cttagagcat gtgtcataac attgctggca aagtcattcc tacttggatc agaaaggata 22440 

agatatttat ttaaatgaaa tgagcagagt ttttcaaaat tcttttcacc cattccattc 22500 

aaaaatattt tgcagaactc atcaaacaat ctagcgtttg agatgcctga taaaagatgg 22560 

cctttatcat agatggcatc cttaaccaga ttatctatct taaaattgag tttgttacta 22620 

aatcttatag ctcttaagct tctcactgga tcttcttcga atcgtctttg gggatctcca 22680 

atagatacaa taaccttctt gtgtatatgc ttaagcccat cgttatgatc ttctattttt 22740 

; tttgtaacag gacagtagta aagggcattc acagtaaaat ctcttctatg acaatcttgt 22800 

tcaagagtgc cccaattatt atctctaaga atctttcctg tcgaatcagt aacaatattt 22860 

T.tcaccatctt cctgatcact tcctgatctg aaagttgcta cttcaagcaa ttcactccta 22920 

^ttaaaaacat gaaccaattt aaatcttttg ccaattattc ttgaagcttt aaatgttttc 22980 

; "Ctgatctgct caggcgtggc attagtggct atatcgaaat cttttggctc tagtcccgtt 23040 

agtgcatcgc gaacgcaccc accaaccaga taggcctgaa aattattttt ttgaaggtct 23100 

tgaacgacag atatagcaaa tttacttatc ttattattat ctatcaaatc taattatgaa 23160 

tttttatgat ttaagggcat tagccattta gctgcttttc tttaatttcg tcaagtgttt 23220 

tgcaatgaat gcaatgagtt gcagttggtc tagcttcaag cctcttaatt ccaatttcat 23280 

caccacagga ttcgcaccaa ccataatcat cttgcttaat ttgttcaata gatagaccta 23340 

ttttgctgat aagttttctc tctctatctc tggttcttaa ttcaaaagca aactcttcct 23400 

cttgagaagc cctgtctact gggtctgcat aggtttcacc tttagctcta agatgatcaa 23460 
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aagttttttg catttcatcc ttaagatgtt ctttccaaag aagaagaacg gcaacaaaat 23520 

gtttcttcat tgctgcactc atatattttt cacccttctt agatttataa ggtgcaattt 23580 

tagatttatt attagctatt gttgcttttg cagacttctt ggctgcaact tttttaactg 23 640 

gagctttttt ggtaacagtt tttttaactg tagctttttt ggcaacagtt tttttaactg 23700 

gagctttttt tactgtcttg gatttttttt cgaccatgta agattttata gaattttagg 23760 

gtggagaaaa tatcagatac tgacaaaatt agctagtcat tttttaattt atttaatact 23820 

ttcaagtacc catccgagct aagtcttggc ccaaatgtct caacaacctt ggaagatgca 23880 

tagtttgcaa acttagcaca tgcttcaata ttattccctt gaaggtaggc atgcataaac 23940 

gatccggcaa acatatcacc agctccattg gtatctattg gagttatttc ttctgcttga 24000 

jgcatgcttct caaccccttt atctataaca acacttccat cggcgccttt tgtaatagca 24060 

ggtcatatagg gcttttcttt ataaaagcta acagcatcat caaggctttc tttaccagaa 24120 

Jaaagcaacag cttcatcatc attacagaag atcatatcta ttccatatga ctctattaaa 24180 

^tcaaattttt ctttaaaacc atgaacaata cctgcatcag aaagagacaa ggctttcttt 24240 

jacgtccttgt ctttaaggtg ctctaagact gaaataacag cattaaagtt atcgtcactt 24300 

£gttaccatgt agccttcgat ataaaaaatt tttgaatttt ctacaacatc aaaatctata 24360 

rtctgatttac caagatacgc actaactcca agcatgctgc tcatagttct cttagcgtca 2442 0 

"ggagtaacta aaattaagca tttcccagtt ggttgatctg tattttcaga gctgacacca 24480 

atatgtttga ctccagccga cctgagacta tcaagatagt ttcttccatc ttcatcatca 24540 

gaaactctgc atacatgatg gcaattcgaa ccataatttg ctgcagcaac aagagaattg 24600 

gttgcagagc caccgcaatc agaaatcgat tcggctccca tttcaataag tttgctaatt 24660 

ataggtgcct gttcttcaga agatgaaaga gtcatagagt cggctacaag gcctacactt 24720 

gataaaaatt catggctgac tttatattga gtatctacta aagcatttcc aagggcgcta 24780 

atatcatatt tcatgtgtta ttcctttttg tattatttgt tttactgttt ctaacgttct 24840 

attaatttct tgatccttat gcattgctga aatgaagcct gcttcatatt tagagggggc 24900 
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aaagtatatt ccacttctaa tacatgaatt taaaaaattt gaaaatagta catcatcagt 24960 

ttttgcaaca tcattaatat tattagggag ttcttctgaa aaaaagaatc cgaacattcc 25020 

accaattctg tttatagaaa acggaatacc tgactcaatc attaaggttt tcatcccatc 25080 

caaaagcaca gatgcatttc tttctaactc tttaaacgga ttttctttaa tcaacaattg 25140 

caacaaagca gttcctccag ccatagctag tggattgcca gacaaagttc ctgcttgata 25200 

gacaggacca gaaggagcta gatagttcat aatttcttct ttgcctccaa aagctccaac 25260 

aggaagaccg ccacctatta ctttccccaa agcagttaag tcaggagtaa tattataaat 25320 

ttcttgagct ccgcctagcg aaactctaaa gccgctcatg acctcatcaa atattaaaat 25380 

agaattattg gctgaggtgg tttccctcaa taactttaag aaatcttcat gacctggaac 25440 

jaaagcccata ttccctgcta ctggttcgac tattacggcc gctaagtcat cttttatctc 25500 

jattaaatatt tctaaaaact gttctttatt attgtattcg caactaaatg tatattttgc 25560 

.Jcaaatctgca ggaacccctg gagagtcagg taagccaaag gtggcaaccc cagaacccgc 2 5 620 

•^cttaattaaa agagagtcaa catgaccgtg ataacagcca tcaaatttaa taattttatt 25680 

ccttcctgta aaacctctgg ccaatctaat cgttgtcatg gttgcttcag tacctgaatt 25740 

Jaaccattctt attttttcaa ttgaaggaat gcatttttta attagcctgg ccacatcaga 25800 

Zjttcaagactt gtcggggcgc cataactagt tccaagcgca acttgatttt taattgcacc 25860 

"tacaatgtct gggtgtgaat gacccattat catcggaccc caagacccaa tataatcaat 25920 

atattcatta tgatcagcat cataaaggta tgcgccggat gctctttcaa aaaatatagg 25980 

attgccattg atatttttga atgcccttac tggtgaattt acccctccag gcataagagt 2 6040 

cttggcctct ttgaataagg ctatggattt atcaatttta ttatgtgtca aagctaattc 26100 

ttcttaattt ttaatttcga tatgatatca acttaattct tattatgtat ttaattgttt 26160 

tgaaaattca tccacagtat tccagttggt gaactcataa gtatttgtaa catctgtagg 26220 

accctttgta atccacataa tcaatctaat catgtgttta tctataaatt tatattttgg 26280 

ataatctatt ttgcctgcaa agactgctaa tttcattggg ttccaaggag acagctctaa 2 6340 
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aaatttttgc atatacgggt ttgtttcagg tgtatttttt tcaggctttc tcgcaactac 26400 

attaactgaa aagaaggcat tttcttttgt ttcaagacaa gcaacatttt tttgaataaa 26460 

ttcataaagt tctggtttgt gtttgccata cctaatgctc gcaccaataa taattttatc 2652 0 

aaattgatat aaatctaact ctactgcttt tgcaatatgt attatttttg aagattcaga 26580 

cacatctaat ttagagaaaa tttttttaca aatttccaga gtttgcccat cggttgtaga 26640 

gtagattagt agagttgatt tcataaatat attttatatt gttaattaaa aaatttcatg 26700 

gcacgcgcaa aatataaaca aaaatatagt atatttaaaa aatgattaaa tatttattta 26760 

ttatcgcttt attcatcagc aactttgcat atgcaaatat tgatgcagct aggtgcgcag 26820 

ggattagttc agatcaagaa agacttgatt gctatgattt aatttttaaa gctaatgatg 2 688 0 

gagctaccttt agatagtaat attaaaactc tcatcacacc agctatcaag gctgtgaccc 26940 

jjcagctgattc aataaaaatt gaaaataaag caacaaaaga aaaggatttc gggcttccaa 27000 

^aaacaaaaat caaaaactct gcaaagaact caataaaaac ctcggtagta aggattaaaa 27060 

=*aaacaaaaag tggtaaatta atttttactt tagaaaatga gcaagaatgg actgctgaaa 27120 

^cttcttatag agcaaggaat atgtttaaac cagaaaccgc agtcatttta gaagaggccc 27180 

Htggttagtgg tttttatatg attaatataa gtaataaaca gaaaattaga ataaagaggt 27240 

ftgaaatagca atgaccatac agagtatagc aacgaccgag ggtgccatga aaagaataag 27300 

"fctctgttctg gggtcccaag atggctcaag cttcagggtt tatgttaccg gaggtgggtg 27360 

ctctgggttt caatatggct tcaagtttga taacgatatt gcattcgatg atgatgttat 27420 

aaattgtgga gatttctcgc ttttaataga ctctatgtcg tacccctatc tttatggatc 27480 

aactctggat tttgttgaag atctctcagg ggctaaattc gttattaaaa acccaaatgc 27540 

caaaacaaca tgcgggtgtg gagagtcatt tacagtttag attttgtaat tgaacccaaa 27600 

agtcctttta cattcttgct tgatgtctga actatcagag cctcattact gactctttta 27660 

tatcccatcc aggcaaatgc catggactca atagcaaaaa catcatggcc taaatcactt 27720 

gagaggacga tatcattact agccatctca gaaattcttt ttaccagata cttgttatgg 27780 
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gcaccaccgc 
attatagatt 
aagaatttct 
tttcttttaa 
ttggctgcaa 
tctaaaattg 
aatgagtaat 
aataattgat 
atatttctaa 
ccaatttgca 
fatagcaacac 
; :gattttgaaa 
vtcgttatttt 
"tggaaatact 
Ifcgactagttc 
2tttattgaac 
iaaatttttt 
cttacctatc 
gaaaccaata 
ttctaaatga 
acatccattc 
tcctcttttt 
cattctgttt 
ggccttttgc 



cacaaatgac 
ttgctgaaaa 
tggataatat 
aaaaattatt 
ttgcaccatt 
cattaccagg 
ttgaaatccc 
tatggaattc 
aatcacttac 
tagaaaatgg 
actctataga 
ataagaatcc 
caataagttt 
caagatgtat 
ctgtcattgc 
tcctccattg 
ggtactggtt 
ttaaactcat 
gtatcgccct 
caatatctag 
ctttgaccag 
gctgcataat 
ggattaaaat 
atattatttc 



aatgtcacaa 
ttcaaccagg 
ttttatatta 
ttgaagcaat 
tttatcaaaa 
gcctacatca 
ccctatattt 
cggaacaaga 
caccagaagg 
aaatctctta 
tgatttctta 
tatctcttta 
ttttactttt 
ttttgtacca 
tccaatatag 
ataatttatt 
cagctgatgg 
aatgtaaatg 
gagaaacttt 
tcgaatactc 
aaaaagaaac 
cgactccatt 
gagaactaat 
cattttcatc 



ttattatgac 
gtacatagaa 
aagagctctt 
cttcctaact 
ggaatctgaa 
gttccccata 
aaaataattc 
ggggcgcctt 

cccgtttctt 
ttaatttcat 
attttactaa 
ttgatagtgc 
aatcttaggg 
atgctaagaa 
atctttttac 
attctttaat 
taattttact 
aggccctgtt 
tttgcctttt 
atttgtatga 
aatcccatcg 
gtgagctctt 
atatgcaaaa 
aaagtactgt 



catttttatg 
tatcttcagc 
tgcctgttga 
caatgtgatc 
gaaaatcact 
tgtcatttct 
gtgggtttct 
cgccacccag 
ttgcaacaat 
gcctaatggt 
atccaatcgc 
ctaaatctga 
attttgggaa 
atgaaatatc 
tcattactac 
agtgtatcaa 
tttataggat 
gccagtcctg 
cttataccct 
tttataacta 
cctgtagttc 
attgtatgaa 
tctaaagggg 
tttttactgg 



gattgaattt 
tttctttttt 
ttttggacat 
tacttttcct 
acaatatgca 
attttttaca 
agctttataa 
agcaatatgc 
atttggatca 
ttgtccagaa 
ctcattaata 
caacgagctt 
tttaattgaa 
aatagcgtca 
ttggagaaag 
aacttgccct 
ctattctttt 
aactcccaac 
tactaaattt 
tctcattacc 
tcaccggcga 
gaacaggatg 
ctcttaaaaa 
cttcagtgaa 



27840 
27900 
27960 
28020 
28080 
28140 
28200 
28260 
28320 
28380 
28440 
28500 
28560 
28620 
28680 
28740 
28800 
28860 
28920 
28980 
29040 
29100 
29160 
29220 
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aaacctattt gcaaaatagg tattaccatt gttaataaac ttagcaatta cgatatcgcc 29280 

attttttact ttctctccat cactataagg agtgtcataa attacatgaa attcgtcacc 29340 

ctctctaata tcaaaaacga agtctacatc ccagccaaag atataagcaa agtccataat 29400 

cacactttcg ggaatatttg catctagagc tgcctcataa aatgaggatt gaataatccc 29460 

gctattgtaa gattcaatta actcaatact tttgcttata ttcttaaata ctatttctgg 29520 

agtaaggcta attgaaatag aatttatttg atctttcatg atctcaattc tggttagctc 29580 

ctcaccagaa tattcaaaga gcattttttc ccctggttta atgttagcaa tgatattttt 29640 

ggagtctaat ctaaaaattt tataagcagt atttaaaggc actgaaaagt tttcaaaaat 29700 

tattgaaaga ttttctccat cttgcacctc atgcatctga tatgttttcg tttgctcgag 29760 

aggtatagag aatttttcag taatctcaat ttcttcaact ggcaaagatt catatgtttc 29820 

ftatatcaata taaagcatta atattaagac aatagaaaca gcaaaagaaa caagtaccgc 29880 

"tctttttgga acttttttaa aacctatcat tagtcctctt gaatatttaa gagttcagtg 29940 

Vttgccaccaa atgctaccga gttctttgag acaacctttt cgttaagaaa ttgcagaagg 3 0000 

^taattaggcc caccagcctt ataaccgctg ccggatagat tttgtccacc aaaaggctga 30060 

fjgaaccaacaa cagctcccac catatctcta tttatgtaaa tatttccaac attacacttg 30120 

Jcactaaata tgtcggccct tttttctact ctagtatgaa tacccattgt aaggccgaag 3 0180 

^cactatcat taatattttc aattagtgca tccatttcat ttgatttaaa tctaacgata 3 0240 

tgaagtattg gtccaaattg ctcgtctttt agatcagaaa tattatcaat ctcaattatt 3 03 00 

gttggagcta caaaattttc atctacatta tcagcagacc taaaaataga ataattctta 3 03 60 

tcttcaaaac ctttgacgta agcattgaga ctatcaagag atgttttgct aattattgga 30420 

ccgatatctg tgtcaagatt ttgagggttt ccaattttta gttctttcat accaccctta 30480 

atcattgata aaagatcgtc atatatttcg tcttgtacac agagaactct caaggcagag 30540 

catcgctgtc cagagctatc aaatgctgaa cgaataatat catcggttgc ttgctcgaga 30600 

agtgcgcttg aatcaacaat cattgaattt atgcctcctg tctctgcaat caaaggaata 3 0660 
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attgattcat ggttacttgc aaggctgctt 
gtgaatgcaa cccctttgat gttattaacc 
cctaggatta aatttaaagc atcttttggg 
atgtagccaa gaattgaagt atgttctgag 
agggctgcac ttatttgccc tataagtatt 
aaaatatggc cttttggtga gtaagatagg 
atatggtctt ctgtttggag gccaacagct 
gcttctctaa tttcatcaat agtattttgt 
taaattagct cagaaggatt tgcttcaatg 
cttttttcaa catgcatcaa agaccattca 
atataatcta gatcatcata tgaggctgta 
atgtcatgag tattggtttt cttataatcc 
; tcttttgaat caaactttcc aagctcctct 
"aaatcaaatc ctttagagtt cggtctgtca 
;tttttttcat cctcaatttt taaatggggc 
fggatcaagta acctattaat aaatgagctg 
"agatagggta gcaagtcttt atatttgcca 
gtatttaaga ttttgtttgc ggacttataa 
aattcataat ccttatttga gccaagatga 
gtagcaaatt tagggtaaat tttttccaca 
gctaagtctg taacagattt ttttgtatag 
gcatgcttaa tttcataatc ccaatatgct 
ctgttttcta gtagctcttc taaccaatct 
tgaacagcaa taccaaatcc tttccagttt 



tgaatttttt ttgctgtttt taaagacccg 3 072 0 

tttgaaagca tgtcaccatg aatgccgtct 30780 

accccaatct catgaaattt gttcacaatg 3 0840 

ggcttaactg ttactttatt tccacatgca 30900 

gcaacaggaa agttccatgg actaatacat 30960 

gcattgatct cgccggtggg accctcaaga 31020 

tgttttgcat agtatctgag aaaatctata 31080 

actgtttttc cggcttcatt cataaggtaa 31140 

tcgtctgcaa ttttttctaa aatggatgct 31200 

ctaatatttt gttcttcaag gctgcccttt 31260 

ccaatatttc ttccatctgc aagagatgag 31320 

tttgctttat aaatcgatga agcctttatt 31380 

tcaagcattt ctaaattcac cctttcactt 31440 

ttaaatatat ttcttggcat tggtatctct 31500 

cctctggcaa gccaagcaga atctgtttca 31560 

tttgctccat tttctaatag tcttcttaca 31620 

atcggcgcat aaattgaggt atttttttca 31680 

agcagctctc ccattccaaa tagtctttga 31740 

tggattgcag aaatggtatg tgcattatgt 31800 

ttaaaaattc tttttgcaca ggctaaataa 31860 

acaggataac catcatagcc atagatttga 31920 

ccttttacga gccgaacatg cataggtgct 31980 

attgtggcta tagctctttt gccatatgcc 32040 

ttaatatttg gtgaaagggc catctcttta 32100 
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ataatctcta aactaacggc cagtctgtct tgctcctcag catcgattgt aatttctaca 32160 

tctttagact ttgcatactc tgtgagctga ataagttttg gaagtagatc agactttatg 32220 

tcttttagct ttttcatttc atatctagga gataaagctg atatttttat tgagacacca 32280 

ttaattgtat tttttgtcag atttatcttg ccgacttcat ctatagcatt cttataagac 32340 

tgataataag tgtcggcttg ttcagcgttt cttgctgcct ctccaagcat gtcaaatgaa 32400 

taaatttcat tttctatatt cttaattttt tttatgtcat caaaatctct gcccataaca 32460 

aattcttgac tgagaatatg catggcacca actacagcat ttctaattgg aaactcgcca 32520 

gattttgaga ttaaagaact taatagggcg cttgggtttt tggtccactc atcgggtgta 32580 

gaaactacct tgcctgcaag aagaagaccc catgttgatg cattaacgaa aacactatcc 32640 

gctttgttta aatgctctat ccacgcaccc tcagataact tctcagatat tattagatcc 32700 

yjcttgtttttt tgtctgggat tcttagtatt gattcggcta aacacatcag agcaacacct 32760 

4*tctttattat ctagcccata ttcgcttaga aatgcatcta gttttgttct ctcgctctta 32820 

psttctctctgc aagcatcaat tattttattg agcatttttt gaaatcgaag ggtcatttaa 32880 

-^aaaatcagaa ttactaataa gatctgaaac tatttcttgt tcaggataaa acttgttact 32940 

^ftgttaatgtc atagctatta ttttaatctt atagtgccaa gtatcaactt actgattcta 33000 

rfctaatgactt aaaaacatta tgatagtcat atgagtgatg cacttaaatt aattaaacga 33060 

C&9 aacc £T ac 9* agatcctcac agagtctgat ttaaaaaaga aattagattc tggaaaacag 3312 0 

ctaatcatta aggcggggtt tgatccaaca gctccagacc ttcatttggg tcataccgta 33180 

ttattaaata agctaagaca ttttcaagat cttgggcata aagtaatttt tctaattggt 33240 

gattttactg gtcaaattgg agatccctcc ggtaaaaata aaactaggcc aacacttact 33300 

tcagaagaat taatttcaaa tgcaaaaaca tatgagaaac aagtttttaa aattcttaaa 33360 

aaagaattaa cagaagttaa atttaattct gagtggtgca acaagcttgg tgcagatggt 33420 

ttgattggtc ttgcatcaaa atataatgtt gcaagaatgc tggagcgtga tgattttaat 33480 

aaacgttata gcgcaaatca aagcatagct attcatgaat ttttataccc ccttgttcaa 33540 
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ggatatgact ctgtagccct agaggctgat gtcgaatgcg gaggaacaga tcaaaaattt 33600 

aatttgctag tagggagaga gctacaaaga tcctatggtc aagaacctca ggttgtttta 33660 

actgtaccca ttctagaagg cttggacgga ataaataaaa tgtccaaatc attaaataac 33720 

tttatagcaa tagatgaaga gcctaatgat atgttcggta aaataatgtc tatttcagac 33780 

gagttaatgt ggagatggtt tgagttactc agctttacct cagagaaaga aataggagtt 33840 

cttaagaaga aaatggaaga agggaccaac ccaagagata ttaagtttct tttagcagaa 33900 

gagttagtag atagattcca ttcagagggt gatggttcga aatgcaagga agcttttctt 33960 

caaagatttc aaaaaggtca aatgcctgat gacattcctt ccatgtcagt tgatgttggg 34020 

gctgagggca ttccattagt aaacctttta aagaattgtg agatgacatc aagtacatct 34080 

gaggcgatga gacttgttaa acaaggcgga gttaaaatcg actctgtaaa aatagaagat 34140 

^cctaaaatgc taatttcaaa aggccaggag tctatttatc aggttggcaa aagaaaattt 34200 

^ttaaaaatta aaacataatg aaaaataaat tagtccaaat atttcttctt ctttttatag 34260 

;ttgcgtgtaa tcaggatagc cctaatatta aacaaattag taatatgcag tattttattg 34320 

"ataatgaaat aagggaagga atctattctg ttgaaccagg cttgcaatac tcaattatcc 34380 

|£aaatggaga tcaaagttct gaatcaccat tgctgcagga tacaattaca gctcattttc 34440 

Jcgggaccct cactgatggc tcagtttttt ggagttctgt tgaaatgggt gagcccttaa 34500 

Icagtcgaact atcaggtcta atagttgggt gccaaaaaat aatctctatg atgaagaaag 345 60 

gtgatgaatg gagagtttat atcgacccaa gtatggccta tggcgatgag ggccggcctg 34620 

ggataccttc aaactcaatt cttatttttg atattgagtt attagatatc caaaaaaact 34680 

aacccttatc tataacggat agagcgtatc cataaacctc agcaacctgg ttaataattt 34740 

tatttttagg tgttccggct ccatgaccag ctcttccttc aattctaatt aatattggat 34800 

tatcacaccc ctgggactct tgaagttttg ctgcaaactt gaatgaatga gagggtacta 34860 

ctctatcatc tcttttagct gtagtaatca gagtagttgg atagcactca ccttcaacaa 34920 

tattatgcag aggcgaataa gctaataagt tttcaaactc atctttctta tctggagagc 34980 
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catagtcact 


ttcccaggcc 


cagcctatag 


taaatttatg 


aaatctaagc 


atgtccaaga 


35040 


cgccaacttg 


tggaattgcg 


actttaaata 


aattaggatt 


ttgcaacatg 


gtcgcagcca 


35100 


ctagcaaacc 


tccattagag 


ccaccttgaa 


tagcggttga 


tgatggagaa 


ccaatttctt 


35160 


gtgcgtgtaa 


aaatttagca 


gaatatgcaa 


agtcatcaaa 


aacattttgt 


ttattaaaaa 


35220 


gcctgcctgc 


atcatgccag 


ttatcaccat 


attcaccacc 


acctcttaga 


ttaactacag 


35280 


caactattcc 


cccttggttc 


atccaagtga 


ggtagctttt 


actaaagcca 


ggtagtcttg 


35340 


agatattaaa 


gccaccatat 


ccatataaaa 


gtattggagt 


attactgtct 


atttttaagg 


35400 


actttttgta 


actaagatgg 


atgggtattt 


gtgttccatc 


tttggatgga 


aagaatttaa 


35460 


agtcagatgt 


aaataaagtt 


gagtcgtgtc 


ctttaagatc 


ttctttccaa 


aaaagttctt 


35520 


gjtgacatatc 


agttagatta 


attttgtata 


tttctcgagg 


agttacaaaa 


tttgtaaatg 


35580 


gjaaaataaga 


tacctcatct 


tcaatttcac 


caccaaaacc 


acccattgtt 


ccttttcttt 


35640 


feytgttgctag 


cttattttta 


tatgctcctt 


ttaaatcaaa 


gaagtgaacc 


tcagtaaaag 


35700 


featcaacgag ataagaaact 


acaatagaat 


tatttataaa 


gctaacacta 


ctaatagaat 


35760 


tagtactttc 


accaacgact 


tcattccaaa 


caaatgagcc 


attttttatt 


gtaagtgata 


35820 


fjbacttttcc 


gttcgcagca 


ttttcagttg 


agtaaaacca 


aaaagtatca 


ttcttgcttt 


35880 


gtaaaaagct 


ataagcgcct 


attagctcat 


ctattaaagg 


aataaaaggc 


tgatcagggc 


35940 


ttagctgaac 


atagagtcta 


tttctttcat 


ctgtgccttc 


accaatagat 


agaaacttaa 


36000 


ttttagaatc 


ttttacaaca 


cttattcccc 


aactccatct 


tggcttttca 


ggattctcat 


36060 


aaacaatcac 


atcctcatct 


tgagcagtac 


caattttatg 


aaacattaac 


tttggagctg 


36120 


tattaatatc 


ttttaaaagc 


tcttccgatg 


gctcgtcgta 


tttttggtaa 


taaaatccag 


36180 


aatcatcatt 


ttcccatgaa 


gcaccagaaa 


atttagccca 


ttcaatccta 


tcatcaagag 


36240 


tttttcctga 


ttcaatatca 


agtactttcc 


aggttctcca 


atcagaccca 


ccatctgata 


36300 


ttgagaaagc 


tagaagagat 


gcatcattac 


taacgctggt 


acttgcaaga 


gagatagttc 


36360 


catcttctga 


gaactgattt 


ggatcaagta 


aaactctatc 


ttggcattct 


tcacagtcct 


36420 
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taatcatcag cttgctttgt tgccatgacc catcattgaa ataataaaaa gtttttttgt 3 6480 
taacctgata aggcatgctt atcgaatctg tatcccaaac ctcatctaaa ttcttagcaa 3 6540 
tagatttttt atatttattt tggcctataa atttttgtgt gaaattattt tgtctctcga 3 6600 
cccagtctgt tgagtcctca cttgtaaaat cctccatcca tctataagca tcttcgatta 36660 
ggtatccatg gacttcttca ctaaaaggaa ctttatttga ttcagggtat tcaaaattat 36720 
ctgtttgatt tgagcaacta actagaatta atagagctgc catgtatatg tatttattgc 36780 
gcataaaaag cataatactt tacaattacc ttatgaatca attcaatcga gcatggcaac 36840 
ttttacgcaa aaaattggaa gttttttatt gtacttgccg cacccatcat gatgcttgaa 36900 
atagcaatgg catcaatggt aaccccaata caaaacgtta ctcagcctga ggatattcta 36960 
gaatttttca atgagaatat agcctttcta ggctctgtta gtctcttagg cgtagtttta 3702 0 
agtatggctt ttatgggagc gctttttgtt tcgtatgcat ccatagaatc agaaaatgag 37080 
attgagcctc taaatgcctt atttttaggc ataagaaaat tctttccact tctgggggct 37140 
tatcttatag cttcagttgg tgtctttttt ggtatcttat tattaatact cccagctttt 37200 
tatgtagcag caaggctttg catttttcct gcatttatta tgcttgaaga caaaggagct 37260 
atagaatctc ttaaattatc ttgggaaaag acggatgagc atggcaccac tttgtttggg 3732 0 
cttaccatta ccttcttttc tttaacaatg atttttgcat cagttgccca atccattata 37380 
agtccgggat taatgcaatt agttgttctt gcaattattg aatatgtaat agtaattcca 37440 
tggggctatg tatattttag tttatacaag tcattaaaaa gatattaggc agattaacta 37500 
gaataaatta atacgaagct tttcgtttac aaaaaaacac accactctta taatccttcc 37560 
tccgaagatt tgcactttgt taattcatgc aaacaacttt gggtctgtag ctcagcttgg 37 62 0 
ttagagcgca cccctgataa gggtgaggtc ggtggttcga gtccacccag acccaccatt 37 680 
ctttgaacat taacttttat ttcctttata cttattctta actaaataag gatagtatta 37740 
tggctaaagg tttagacaag caaaaaaacg acaagaagaa aggcaaaact ctaaaagaaa 37800 
aaagagcggc aaaaaaagaa aaaaagaaat agttttttgt agaagactat tattatctag 37860 
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tcgacctagg gccatcttta aaatctattt taaagaacaa cattattccg gtcactgagc 37920 

ttactaaagc taaaatagaa aatattttta aaagaaggtt gtcgatatta tctctttctt 37980 

tccaatccat aatatgaaac ccccacatca gatcccatat tttccattta ttagatctaa 38040 

tggccacaac ctctgctgag taaacattaa tgtaaacatt taattcagta cctttaacat 38100 

ttttactttt tactctataa attggcaatg atctgcctct atattcagat cctgattttt 38160 

catttacaac ttcttcggta gcaaatggca aaagagttgt ttggcttgaa actgaatcta 38220 

tagcatcttc catagaaatc tttgttaaag gctgcccaag catattcaga tacttagttg 38280 

agccctttgt tgtgatgata acaatctctt gacctaatct ttttttgaat ttaacttctt 38340 

tagctttctc tatttcaaaa tttaacttac ttaaatcaaa agatgtttca acatgattaa 38400 

rjjgtattgctc gcctcttacg agctcaatct tattaaaagc aaagtatatg cctgagatag 38460 

ffitccaaagtaa taactgaaga gagataaaaa aacttaggta cttatgaatt ttcctaacta 3 852 0 

%aaaagttcat cttttcataa tcttcattat ttcatcaatt tttacatcaa attcgtcatc 38580 

^ :; ;accttttaaa gattccttaa tgcattcacg caaatgggtt tttagaattt tgccctcaac 3 8640 

!Lagtaataatt gaattcttaa gtgcttttat ttgattaaga atatcaacgc agtattgacc 38700 

jljttcctcaacc attccttgaa cgcctcgcac ctggccctct attcttttta agcttattaa 38760 

^"ttgctcttta tgacatggat gtttcatttt ataaaaacct cggaactata tacgtaaaaa 38820 

ttgtagcgaa taaaaaaatt gctacagatg catagtacag atacctagat ttttttctag 38880 

tttctgaaca tgctctgcct agttctgggt cagcagggca aggcatatga taagttttgt 38940 

agttgatata tcctgcaatg acaatcatta ctaatgcaaa tagtgttatg taaagcttat 39000 

attgagatag ggtaattaag aatggaaata cagttaccag gcttgcaaaa cttgccccag 39060 

caccaagagc aacaaaaata gctggcaagg cacaacatat caaagtagaa gaagatgcaa 3 912 0 

ataatgaaaa aaaattcgac gccttatcat tcatagttat taaatttcta ggatttgaag 3 9180 

atcagacata ttttggccat tagaaagaat cttcttttcg atctcagaga aggtgatttc 39240 

ttttgcctta gaaaaggcaa tcactacttt tccattatca agatcaacat ctattctttt 39300 
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gacatcttta tcttttagaa aagtttttga aatacctctt gcacagaaat cacaaaccat 39360 

tcccttcacg cttacgatcg ctatatttac atttgagagt ccttcaacaa atgcatcaaa 39420 

cctctcaggg tttacatcaa gatcttttcc atcaaccatg gtttcatgta aatgaccttc 39480 

atgagaatgc ccatgcatac cactatggtc attctccgcc gcaataaggc cgcttgaaag 39540 

caaagtaatt aataataatt ttttcatatt tttctccttt taaaatctgt acataaaatg 39600 

taaatcccag ctcgtatctg cgtcatagcc aaactcaaca agggcattac cataaaaaaa 39660 

tttcaattct ggatacgtag atgatttatt tgttacagta ttttttttgg ttttaaccat 39720 

aacccatgta tgtaaatcat tgtagtcagc aacgtaagga gcaaatccaa gttgaatata 39780 

ttcttcttta tagtctttgc taaatttact gttaatatcc tttaacccaa agcccgcata 39840 

ccacttccgg gtttcccagt ctccatggat accataaaaa ttatttttaa tacttccagg 39900 

cacaattcct gattgaaaat ataaattcct ttgggatgta aaagtatttt ttctattcac 39960 

caaataagta aaacgaaaat agttttctgt gttcttataa aatttatttt tagccctttc 40020 

taggccaatg gaatatttgt aggttggaga gtagtggaaa taaatagaat cattaaaggt 40080 

atctgattta tacattatgg ttgttccgcc agagtaggat atcggtcttg catcagcatt 40140 

aaagaccatt aaaaaaacta aaatgattat tttatatata cccatggggg gtatcttacc 40200 

ataggattat aaattaattt caggcatctc tccaagcgcc cattttcaat gaaatcacca 40260 

tcagaattcc aaaaataacc atagctatag cagatgcata aaaaatacta actagcccat 40320 

atgagagcca gtaaacagct accaatgcta taacagaaaa tactataagc ctgatggccg 40380 

tgattataat tggccatttc attgcattgg ctccctgaga tgcaaaatac agggaaagcc 40440 

catatccttg aaatacataa catacaccaa ggatttgaat atactgcttt gtaactaata 40500 

aggtctctgg atccgatgta aaaatactaa tccaaagatt tggtgttagt gctaaagcta 40560 

gaccgataac agccgaaagc aatccagctg ttgttgcacc aaccatgcct attttttctg 40620 

ctcgctctat atttttggca cctatatttg ttccaaccat ggctgtcata gcagtcccaa 40680 

tgccgaacac aattgggatt aataagaact ctacccttga accaattcca taaccggcaa 40740 
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tagcagacgt tccaaattga ccaatgagtc ctgtcagtaa aagaactgtt gcaactgtca 40800 

ttattggcga aagagatgcg ggtaaggcaa cagaaaaaat atcttcgaat aattctcttt 40860 

caattgttag tcttttgagc ttaagcctta caggcgatga agggctcgaa atttttatta 40920 

aagttactaa tgccataaaa cctgaagtaa ctatcattga ccaagcagat cccacaagcc 40980 

caagtttagg tagaccgaat gatcctaata taaatccagc agagaaaaaa acttgtatac 41040 

ctgcacaaat taccgttagg actgcaggaa attgcatatc tcccatccct cttagggcgg 41100 

cggttaagct acctgatagc caaacaacta ttgctcccaa taaatagaca aaacaataag 41160 

ctaaagactc ctctaataat gcacctgttc cacctaatat tcttaataga ggctctccaa 41220 

aaataaaaaa gataattaaa aaagctaaag ctccaaagca agatatatac aaagaatgcc 412 80 

ataataattt ttcagcccta ggcttgtctg cggctccaag acttcttgca atagaagatg 41340 

tgaccgctcc tccaagagca ccaaatgcca tctgctgagt cagcatgatt gcaggaaatg 41400 

ccaaagtaac agcagctaat ggagttatgc caagttgact aataaaccag aattcagcaa 41460 

gaacaaccac agcatttatt aaaaatgcaa cagtattagg tgctgacatt ttaatcagca 41520 

aaggaaatat tggatctttt aaaaattgct ctgttctctt gtccataacc ttctcattta 41580 

ttaatattta tttggttaaa gggtgaaaga gaacacgttc tcaggattca cgaatcttta 41640 

aataataata acagctttta gttaatttaa tttttaaaaa caaccaaatc tattatctgc 41700 

tagattttat tttttaaaca tattaatatc taatttaatt gcaaaggggt ggccatttcg 41760 

gcctgagatc cttggaaact ctttgaacct gatccataca ataatggcgg aggaattgca 41820 

tgaaaatcat taaaaatcat ttattttatt caataatctt actatatttt tttagtctag 41880 

agacgagctc tcaagctata gaagaagtta ttatcaaagg agactggaga gaaactagtc 41940 

tgtcagcaga agactcaagc attgcagtcc ttgatagtaa attaatagaa tcccaggctt 42000 

taaagcactt tgaaaacctt tcgtacctag taccaaattt aaattttgct gcaagtgatt 42 060 

ctagagcaag acatttccag ataagaggaa ttggagaaag atctggctat gaaagaactc 42120 

caaactctgc agtaggttta ctgattgatg atatagactt ttctgggcaa ggtggaattg 42180 
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ccacaacttt tgatgttgat caaattgaag ttcacagagg cccccaagga gcaagaatag 42240 

ggtctagcgc aatggcgggg ctcatatata tttctacaaa agacccaaca gaaagctttg 42300 

aagggaaggg tgaaatagtt atggggtcat atggaacttt taataccgga attgctgttg 42360 

gcggtcctgt aaattttaat aaagacctca cttataggtt agctataaag aaagattatt 4242 0 

ctgacggatt taggaagaat atttttctca acaagtctga tacttctaag aaagatgaga 42480 

gcacatttag attaaaagtg aattgggtaa ctgataacca aaccacttat aagtttctta 42540 

tatctcaaat agagttagat gatcctgcag atatatggac tatagatggg agtcttaata 42600 

cattatctga taagcctgga atggactctc aaaaaagtaa tgcctatggg gtaaaaattt 42660 

atcatcaatt taaaaaattt gaatttcaaa gcctatcaag cttaacgaat actgatgtca 42720 

ttcttagtta tgatgctgat tggggcaacc ctgagtcgca ttcacccttt atttatgact 42780 

atttttcaga aactacaaga aaaagagata cttttagtca agaatttaga cttgtatccc 42840 

aatttgcaga taaaaataca gaaaaaagca tcgaatgggt cgttggggct agttttgtag 429 00 

atataaatga aacaaatgct aaaaaagata ctggtattta tggagatcca tcagacccat 42960 

atggtcctta ttttagtaac tcttcttctt tgagtgactt ttcttcttca agctattctt 43020 

tatttggaaa tattgattat ttaattaatg aaacaataaa aatttcaatt ggtggaagat 43080 

gggagaattt taaatctaat tattttgatt cttatgacga atcattttca ccatcaaata 43140 

aaatgtctgg tggtaagttg tcactagtta aaactcttaa taataattct aatatttatt 43200 

ttaatattgc taagggctat aaccagggtg gatttaattt aggtcttggt cttgataaaa 43260 

attcatcaaa tagaaattta tattatgatc cagaattttt aactaattac gaagttggaa 43320 

ttaatagtaa atttttccag tcaaaattaa atcttggagc agtcctgttt tattctgatc 43380 

gaaaagatca acaggtctta atttcaaccc aggttgatcc ttcagatccc aacacttttt 43440 

tatacttaac ccaaaatgct gcagaaggaa ttaataatgg cttagagtta aatatagatt 43500 

atgcactgaa taaatctcta ggtatatttg ctaattttgg attgctaaat acagaaataa 43560 

aaaattggat ttcaagacca gatatagaag gtagagaaca ggcgcatgct ccaaaaaata 43620 
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gtttttcaat aggcatcaat tggaagccaa caaaccaatc ttatttatca ttgaatgttg 43680 

ttggtaaaag tgagttctat tactctgatt cccataacaa tacttctgag tcatacaact 43740 

taacaaacat taattatgga tatgaacatg gacaatggac ttattcatta tgggcaagaa 43800 

atatttttga taagtactat tcagtaagag gtttttactt tggtaacgag gcacctgatt 43860 

ttatagatac gctctacaga agacatggag atccaaggca tataggagtc atggtccaat 43920 

atgatttcta actttataag tgaattttgg atggaaatag cagctgttgt atttgctatt 43980 

atttatttat tacttgctgt aaaacaagac gtaaagtgtt ggtttgcggc tataattagc 44040 

tcaatattat atttctttat tatgtatgac gcaggcttat atatggaagc ttacctgcaa 44100 

attttttata tcatgatggc cttttatggg cttcaacagt ggagaagtgt cgatactgat 44160 

gctcctcaat ttattgttag aacatgggat aaaaggatgc atattaagat cattacatta 44220 

atagttatca tgactttaat ctctggattt ttattagaaa aatataccaa tgctatcttg 44280 

cctttcatag atggactaac aacatgggga gcaatagttg cgacatatat ggttgccaaa 44340 

agacttcttg aaaattggat atattggttt gttatagatt ttatttctat ctttttattc 44400 

atgtccagag gacttttatt aacctcagga ttattcttta tttatcttgt aataatatat 44460 

tttggttaca tgtcatgggt aaaaataaga gacgatatta gtgcagaatc ttcataacaa 44520 

tctagatata aaaaatcata atctagagat tattaaaaca attaaatcag gccctgtatc 44580 

tgaaatatct atttgtaatt ttgataacat caaagcaatc ttaagagttg atcatccatg 44640 

cgcacacaaa ataaatgtag atcgagaaaa tgaaattttt atactcagcc aactaaaaat 447 00 

tttagatttt agtccagagg ttttatttag tgatttgtct tatggaattt tagtatggag 447 60 

atatatcgag ggtattgaat tttcacttgg taaggattcc aatgaagttt ttttaaaaac 44820 

acttggaacc gaattaaaaa aaattcatga tattgatctc cccaaaagta agaaaaaata 44880 

ttttagcaat gacataaatt tttacagaaa tttactaaag gaggttcctg aaaatataat 44940 

tcttcataga ggatttgatt tatacgacaa actcaataac tctgataatt atgttctctc 45000 

tcataatgac ttgaataaga caaaccttct ttggagggat aggttatttt ttttagactg 45060 
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ggagtactcg 


agttttaaca 


atcctttttt 


tgatattgcc 


tcgttatcga 


atgcttataa 


45120 


cttatcaaaa 


gttgataggg 


caattttatg 


gaaagcttat 


acaaataatg 


aatattcagt 


45180 


attaaatgat 


acaaatctta 


gagaatggat 


gcatttttgt 


cattatttag 


agtacatgtg 


45240 


gagtatttcg 


ctcatacaaa 


atggaaaaat 


tgatcagaac 


accctaaatt 


taaaaaaatt 


45300 


agagaaaaaa 


ttaaaaaata 


ttatttaaac 


aacaagtatt 


gcgcattctt 


tagttgttgt 


45360 


tttactatta 


tgtaagaaat 


ttacatttaa 


atgggggttt 


gatatgaaaa 


ttttatgtgt 


45420 


cttatatgat 


gatccaaaaa 


caggtatgcc 


agaaaggtat 


gcaagagatg 


atttaccaaa 


45480 


gttagataag 


tatcctgatg 


gaatgacact 


tccatcccca 


aaatctatag 


attttactcc 


45540 


tggtgagtta 


cttggttgtg 


tatctggaga 


actagggctt 


cgaaagtttc 


ttgaagatgc 


45600 


tggccataca 


ctagttgtta 


cttctgataa 


ggatggagat 


ggatgtgagg 


ctgataaaga 


45660 


attagtagat 


gctgatattg 


ttatatcaca 


acctttcttc 


ccatattatt 


taacaagaga 


45720 


caagatgaaa 


acagcgccta 


atttaaaaat 


ggcaattaca 


gctggcattg 


ggtctgatca 


45780 


tgttgatctt 


caggcagcca 


tggataatag 


cgtggatgtt 


gttgaagtta 


cttactgtaa 


45840 


ttctcgttca 


gtcgccgaac 


acattgtgat 


gatgatccta 


tcgatggttc 


gtgattatca 


45900 


tacccaacat 


agaattgtaa 


aagagggagg 


atggaatata 


gctgatgctg 


tacaaaggtc 


45960 


atatgatgta 


gagggcatgc 


atgttggtac 


tgtagctgca 


ggtagaattg 


gtattgatat 


46020 


gctaagaaaa 


atgaaaccct 


ttgatgtgca 


tttgcattac 


tttgatattc 


ataaactctc 


46080 


tgatgaaata 


gaagcagaac 


taaacctcac 


ctatcatgat 


tctgtagagt 


cattagttgc 


46140 


tgtgtgtgat 


gtagttaata 


ttagttgccc 


attgcatcct 


aaaactgagc 


acttattcga 


46200 


tgatgaaatg 


attagtaaaa 


tgaaaagagg 


tgcatatatc 


atcaatactg 


ctcgtggcaa 


46260 


gatttgtgat 


aaagatgcta 


ttgcaagagg 


cttagagtca 


ggccagctaa 


gtggttatgc 


46320 


tggtgatgtt 


tggttcccac 


aaccagctcc 


aaatgatcac 


gtatggagaa 


caatgcctaa 


46380 


ccacggcatg 


actcctcata 


cttcaggtac 


ttcgctatct 


gctcaaacaa 


gatatgccgc 


46440 


tggggttaga 


gaaattctag 


aatgttattt 


tgcaggcgaa 


ccaattagag 


acccatattt 


46500 
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gattgttcaa aatggtgatc ttgcaggtat gggtgcgcac tcgtacacaa agggtacagc 46560 
cacagatggc tcagaagagg ccgctaagta taaaaaataa gttttagaac ttacttagcc 46620 
tttccttaat aatagagtct gcctctgcca taatgctatg cattaattct tccactgtcg 46680 
ggatgtcatt aaccagtcca gcaaccattc cacatgacca ggctccaacc tccatagttc 46740 
cttcatgcat aatttttgga tagactcctg caacctcatc cacaatatca gcaaaagtta 46800 
attcatcacc aagagctttt tctttttcaa tcaatctctc aacagcttca ttattaagaa 46860 
ccctttctgt atttgttaat gatctcatga tgagtctagt atctaactca gaagcattta 46920 
cgatagcctc tttcacattt tgatgaacag gtgcatcttg agtagcaata aacctagttc 46980 
ccatattcat tccctcagca cctagtgaca tggcagcaac caaacttctt ccatctgcca 47040 
ttccacccga ggcaacaaat ggtatttcaa gctcatctgc tgctctaggt aagagtatga 47100 
aattaggaat gtcatcttct cctgggtgtc cgccacactc aaaaccatct acagaaaccg 47160 
catcgcaacc tattgcttgt gcttttaatg agtgccttac agaggtgcat ttatgaataa 4722 0 
cttttattcc tgcttctttt aaagctggga ggtattctgc aggatttctt ccagcagtct 47280 
ctacaacagg aacccctgca tcaataatta cttttatcaa accaggatag tctgggggtg 47340 
ttagtgatgg taaaaatgtt aaattaacag caaatggctt attggtcatt tctttgcatc 47400 
tggcaatttc atttgctaat ttctcaggcg taccctgtgt tagaccagta attgttccaa 47460 
gtccacccgc atttgatact gccgctgcaa gctcggcaaa accaacatgg tgcattccac 47520 
cttgaatgat tggatgttct ataccaaata attcagttat tttagttttc ataatactct 47580 
cctatttatc ttccattggg gttaaaaaat cttcatattg agtttgaagt ctttgcatgc 47640 
cacttatcca tctatctctg tcattacctt ttcttttaac atattctaga acctcactat 47700 
gaggggttac taaaaacctc tcctcctcta tggcgttaag aacatccttt gcaacaatat 477 60 
cagcttccat cattccatca actcctgcta caccaggacc atttgcagtc atagcagttc 47820 
taactgcttg agggcataaa caggaaacgc cgataccttt atttccatat gtaattttta 47880 
tccattcagc aaaactcaca gctgcagctt ttgtaactgc atatccagcc gcacctagtt 47940 
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gagttaaaag accggctgct gaggatgtat tcataagata gccttcacct tgttctatca 48000 

tttgaggaag cacatgtttt gcagcatgaa tatgagactg aacatttaca ccccatatca 48060 

tgtcccaatc cgaagtatct gcttcaaaaa atcctggctt tccaccgata cctgcatttg 48120 

aacaaaatat atcaatacca ccagaaaatt cattagcctt ttgtataaca tttataatgt 48180 

cgttttcttt ggaaacatca gcacttactg caagcccatt aacactcttg gctgtttcct 48240 

ccgccccatt taaattcata tcaacacata ctatagaact tgccccagat gcatagaact 48300 

cttcacataa agccttacca attccactgg cagcccctgt aaccacaact cttttattat 483 60 

taattttcat aaattgacca cttttttaat tttttattta actttattgc ttagatagtt 48420 

tattacaaaa caacaatact taatattgca aaatcgcata cccatcttta taaatttatt 48480 

gctattatta gtgctagaaa atgaaatact gagatttaat atgaagaatg ttgttgttat 48540 

tggctcgtcc ggggcaatag gaaaagcctt tattgatagc tatatcaaag atgatgatgt 48600 

tgaaaatata ttttcatttt caagaacagg cctttccatt gaggataaaa aactccatag 48660 

tttttttatt gatattgagg atgaaactag tatttgtgat gccgcagaga agatagacaa 48720 

gtcctcaata gatgaaatta tcgtcgcaag tggaatactt cataataaag attttgggcc 48780 

agaaaaaagt attagagatt taaatgcaga taacctttta aaggtcatta aggttaatac 48840 

tatcggccca acaattgttg gaaagtattt cattccattg ctaaataaaa aagaaaaaag 48900 

cgtcttagca tttttaagtg caagagtcgg cagcatttct gataataaaa caggtggttg 48960 

gtatgcctat agagcgagta aaactgcact taatcaaatc attaaaagtt ttagtattga 49020 

attacgaaga accaatccaa atgccattat ttttggtcta cagccaggaa cagtagatag 49080 

tgaattaagc gaacctttta aaagaaatgt aaaagaaggt aatttattta ctccagaata 49140 

tagtgtattg cagctaaaaa atattattga tacagcaagt ccatctgatt caggcaaact 492 00 

aatttcttgg gatggggaag agattcagcc atagttggtt atgaatatat tttcatatta 49260 

gaataaattt ttaggggaaa aaatgagtat taaatattat gactggtcta aatttcaggc 49320 

caacactagg ccaaataaag ttgccataag agagctagat aataacaaga tctatactta 493 80 
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cggagaattg gataaaagat catcaaggct tgcatcacat ctccaaagtt caggaataaa 49440 

aaaaggagat cgtattgcga tactatcgct aaattgttca gaattttttg agctagaatt 49500 

tgcttgcgga aagattgggg caatagagat accattaaat tggagattaa caaaacccga 49560 

gctcagttat attcttaatg atagtgagcc aaaaactcta atttatgaca atcagtttga 49620 

agaaatggtg aaagagctaa aagaagaatg taatatttct gaaatcatag ctcttgatca 49680 

atttgaccaa gaaagtgatt atgaaaaagt tttgagtaat gcttcaggca tttattatca 49740 

ggaagaagtt gatctagaag ataacattat gattatgtat acctctggaa caaccggtca 49800 

ccctaagggc gccatgatca cgcacaaaat gcagcttttt aatgttatta atttaggtat 49860 

ttcagcagct gtttcccctg aatcagtcca tttagttgtc cttcctttat ttcatacagg 49920 

cggaatgaat tgttattcaa atccaattct tcatgcaggt ggcgagttaa tattacttaa 49980 

agagtttgag cctgggaaag ttctatcaat tatcggcagc tctgactatg gagttactca 50040 

tctgtttgca gttccagccc cttatcaatt tatgatgaat catccagatt ttgaatcaac 50100 

aaatttatca ggagttaagt atgctggagt tgggggcgca ccttgtgcag aggctatttt 50160 

gaagacttat ataagcaagg gtgtttcgat gcagcaagga tggggtatga cagaaactag 5022 0 

tccaggtgct actggtcttg aatcgtccga ggctgaaaga aaaataggat ctgctggaaa 50280 

accgcttctt catactgagg tcaaggtggt tggagatgat gggaatgaac tgcctgctgg 50340 

agaagtaggc gagatttata ttaaaggccc aaatattaca cctggctatt ggaagaaaga 50400 

agaggctact agagattctt ttgaagacgg gtggttaaaa acaggtgatg ctgcttactt 50460 

tgatgacgag ggttttttat acatagttga tcgatggaag gatatgtata tctcaggtgg 50520 

agaaaatgtt tatccagctg aagttsaaaa tgttatctat cagttaccac aaatcgcaga 50580 

agttggagtt attggtattg atagccctaa gtggggtgag actggtaaag cctttgttgc 50640 

tttaaagccc gatcatgaat tgacggcaga agaagtcata gatcattgtt taaaaaatct 50700 

agcaaagtac aaaattccag agaaagttga gtttattgca gctcttccaa gaaatgctac 507 60 

aggtaaagtt ttaaaaagaa cattaagaga tatgtaatat ttttaagcca aaaaaaaccc 50820 
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agcctaagct gggtttttta tttaactatt taaagttatc taatgttaag cattagaaga 50880 

ttctttaaca gcaacattcc atataattaa accaaataga atcttgttaa caaagtcagc 50940 

aaggttatag ataaggttta agttaagagc tgatccaccg tcacccatca ggtaacctgt 51000 

gaaataacct acaggataaa tcgcccaacc aaagatgata atatacatca ttgtgttgta 51060 

agctgattgc acagcaggac ttgcagtatt acatgcagat tttccttctc cagcccataa 5112 0 

ttcataaatc atgtataccc aagctaaaca cccaataatg aatgcaggcc atgcagccat 51180 

gattcctgct tcacccatgt aaccaaacac aagcataaca agagaaccaa ctagtaattt 51240 

cttaaataat gatccagcaa cattagttgc agcagcaaga attaagtaga attcacatat 513 00 

taatagagga actgttagta accaatcaat gtatctaaat acagttggcg aatcaccagt 51360 

ttcaatccat acccctctca tgtacatgta atgccagaaa gcaataccag taacaagacc 51420 

agatacagtt aatgatgttt tccattttgc agaaactcta tctctttcaa caaagaaaaa 51480 

tacagtagat gctaataaag cagcagtaac taaccaaaaa gaaacaccag tgtaatcact 51540 

agcatcaagg tcaccaccac ctgcagcaaa tgtaggaagt gcaataacac tacctaatat 51600 

cagtaataat ttcatatata actcctatat taatgacaaa gaagcataat tgcctccccc 51660 

ctaattaaat atgaaattta atagtaacca atattaatag aggttacaaa taaaaagcat 5172 0 

ctattttttt aatagaaaat atgtgtaata aaacaccata ctttaaatta atatttattt 51780 

tgcataagta gacttgttta gtatcatatt taacatgaaa gttgcaattt atcctggttc 51840 

ctttgacccc atcacaaatg gtcacacgga tattattgat aggggttgcg gactctttga 51900 

caaggttgtt gttgcaatag ctaagagtga atcgaaaaac cctcttttta gcctagagga 51960 

tagaattaat ttagcccaat ctatttttaa aggaaatgaa aaagtagagg ttgttggttt 52020 

tccaagaaag ttaacagttg atcttgcaaa agactatgga gcttgtgcaa ttataagagg 52 080 

cctacgagca gtttctgatt ttgaatatga atttcagtta gcaacaatga ataggtcgct 52140 

ggctcccaat attgaaagca tttttttaac accaaaagaa agtctcattt atgtatcttc 52200 

tagcttaatt aaagaaatat cagacttaaa aggcgatata tcgaagtttg ttcatcctat 52260 



Page 38 



agtagagcag 


gcacttcgag 


cgagtagaca 


cttagctctg 


acaagcttca 


caaaaaaaag 


52320 


tagctctttg 


attgacgatc 


gttttacaaa 


tagtgccttt 


gcatttattg 


cagggctcgc 


52380 


cttctcttcc 


atatacattt 


aatttaaatt 


taaaatatcc 


tggactacca 


tcagctgagt 


52440 


agaagtcttt 


taatgtagtt 


cctccaacct 


ctattgcttg 


ctctagtatc 


ttttttccag 


52500 


cagctactaa 


tcttttacaa 


gcatctaaat 


ctaactcatt 


ggcatttttt 


agcggatgaa 


52560 


ttttagctaa 


gaaaaggctt 


tcagacgcat 


aaatattgcc 


gataccgact 


acattttttt 


52620 


gattcattaa 


atagctttta 


atatttgttt 


tagagtgact 


gcatcctgaa 


aaaaaatcct 


52680 


ttgcattaaa 


atttttagaa 


agaggttcag 


gcccaaggtt 


ttttattaac 


ttatgtttat 


52740 


cgatatcaga 


agtaagatgc 


attgatccaa 


accttctggg 


atcattataa 


ataattcttt 


52800 


cctcatcaaa 


aattaattca 


atatgatcat 


gtttgataaa 


gaagttttca 


ttattttttg 


52860 


caattctaag 


actcccagac 


attcctaaat 


gaagaataat 


ttttttatca 


ttagatagtt 


52920 


taaaaattat 


atattttgct 


cttctttcta 


agctttcgac 


aacttgatct 


tttactgagg 


52980 


tttcaaagct 


atccaccact 


ttccatctaa 


ggtttctgtt 


gtgtatcctt 


gcttctttca 


53040 


gcaaagagcc 


tttaaatttt 


ttaattgctc 


tcagggttgt 


ttcaacctct 


ggaagttcag 


53100 


gcatttttaa 


ctgagtaaat 


tattaatttt 


tactatgtca 


gctggagtaa 


ttgttcccga 


53160 


tgcgagccct 


agtcttagat 


tggcaagtat 


gtaatcatat 


tttgcattag 


caagattttt 


53220 


ttccgcgctg 


tataagtttt 


tttctgcctg 


caagagatca 


acaacgtttc 


ttgttccaac 


53280 


tctgtagcca 


acttgagtcg 


cttccagggc 


actagtggcc 


gaaatcactg 


cttgtttttg 


53340 


agcatttaca 


tttgcaacta 


atgttaaaac 


atttgaaaac 


tgggatctga 


cttcttgaat 


53400 


aatccttctt 


tctgtaaata 


gagtattttc 


atttgctctt 


tcatactgtg 


aatatgcttg 


53460 


cttccttctt 


gagttaacgg 


cgccgccttg 


aaagagtggc 


atacttagct 


gaattgcata 


53520 


attccttctt 


cctgttactg 


atggaactgg 


aataccttgg 


ccattgatat 


taaaaccttc 


53580 


atagttaaat 


tggtttgttt 


cagattctga 


ctgacttcca 


acaatgtcta 


tcttaggtaa 


53640 


atgatttgaa 


gctacacttc 


ttgcactgct 


tttcgctgct 


ttctttctca 


aatatgctgc 


53700 
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ttttaactgg tagttatttt ccaatgctaa ttcaacccat gtctcttttg aacttggtgt 53760 
tggcaggtca ataagcaaac catctcccaa ttcatttaag ctgaatattt ctctaccaat 53820 
cagagcattt aaagactctc ttgcagaata aagtgatcct tctgttctaa ttcttgaggc 53880 
tttacttaga tcaaatgcca attgagcctc ttgaactcca gttatggctg ataacccaac 53940 
atcgaatctt tgttttgctt gatcaagttg ttttttaata gctttttctt cagatattgc 54000 
tgcatttaga ttatcaatag ctctaagtac gccaaaataa agctcagcag ttcttactaa 54060 
aagattttgc tgctcaaatg caaagtctgc ttcagcagca tctgtaagag atttagattg 5412 0 
cctatattga aaccatgtat ctagtctaaa gagcggctga gtaaccctgg cagatgtaga 54180 
aaaagagtta tattgctgct gcagttcttt gttttgatag tattcgttcc agttagttga 54240 
tccactcaag gtaatactcg ggagaagcgc agctcttcct tgaaccttaa gctctttatc 54300 
tgctaaatat gaatattccg ctgctttata tgtagggtca ttctcaagcg cttcattata 54360 
gatatccaaa agactttcag atgaaatatt aaatgagata aaaagtgcta aaaatgattt 54420 
tgtgtaaatt ttcataactt attttaacct attaataatg tttgcagtgc aaacatttat 54480 
tttttttaat tttattgaat ttattttata tttcttagag tagaatatct ctaacaagtt 54540 
caataatttt ttataataaa aaggctttaa aaattggcta aaaattcata tgacgctcag 54600 
gcaattgaag tcttatctgg actagatcct gtcaaaaaaa gacctgggat gtatacggat 54660 
acatctaacc caaatcattt aattcaagaa gttcttgata attcggttga tgaagctctt 54720 
tcaggttatt gttcaaacat aaaagt;atct gttctaaaag atggctttat taaggtctct 54780 
gatgatggaa ggggtatgcc aattgatgag cacccggaac ataaagtttc aggtgttgag 54840 
ctcatccttt gtaaacttca tgctggagcg aaattctctg gagatgatta taatttctct 54900 
ggtggccttc atggtgttgg agtttctgtt gtaaatgccc tatctgatga attagaggtt 54960 
agagtaaaaa gagattctaa agaataccaa atcactttta ataatggaga taagtcttct 55020 
gaattaaagc caattgggga agtggggctc agaaattctg ggacatcaat taaattcaaa 55080 
ccaaatccta catattttga aactatagag attcagataa aacagcttaa gcatttatta 55140 
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aaggccaaag 


cagttctctg 


tcctggatta 


acgatagagt 


tcgttaatga 


aaaaaagact 


55200 


gatgataaac 


aaaagtggta 


ttttgaagat 


gggctcaaaa 


gctatttgat 


tgattcttcc 


55260 


gagggagcag 


acttggtttt 


gctagattca 


attgtatgct 


ctaaaaaatc 


tcaagctcaa 


55320 


gagcttgaat 


ttgcaatcaa 


ttggtcatta 


agacccccaa 


aaaataaact 


cgatgaaacc 


55380 


tatgtgaatc 


tcataccaac 


tgctcagggt 


ggctcacatt 


taaatggctt 


taaggctggg 


55440 


cttttagatt 


cattaaaaga 


attttgtgaa 


tacagaaatc 


tattgcctaa 


aggtttaaaa 


55500 


attaatgcag 


atgatgttct 


taataatgca 


atttttataa 


tttcatctaa 


gcttcagaat 


55560 


cctcaatttg 


cagggcaaac 


caaggaaaga 


ctagattcaa 


aagatcacat 


gtcgttcgtc 


55620 


tcaagtacca 


caaaagacat 


tttaagtatt 


tggcttaaca 


ctcatacaga 


agagggcgaa 


55680 


agaatagcag 


aacttgcaat 


tatgtctgct 


cagacgagag 


caaaagtttc 


caatatagtt 


55740 


gaaagaaaga 


aaacttttag 


aggcccagcc 


ttacctggaa 


aactttcgga 


ctgtaatagt 


55800 


caggacttaa 


atgaaacaga 


gcttttttta 


gttgaggggg 


actcagccgg 


agggtccgca 


55860 


aaacaagcaa 


gagaaagatc 


tttccaggca 


atcatgcctt 


tgagagggaa 


gattttaaat 


55920 


acttgggact 


tagaaagtgc 


agaaataata 


aaatctcagg 


agataaaaaa 


cctatcaact 


55980 


gcaattgggg 


ttctgccagg 


aaataatgac 


ctttcatcac 


taagatacgg 


aaaaatttgt 


56040 


attcttgcag 


atgctgattc 


agatggtctg 


catattgcaa 


ctttactttg 


tgcattgttt 


56100 


ctaaggcatt 


ataaatcttt 


agttcaagag 


ggaaggatat 


atatttcaat 


gcctcctcta 


56160 


tatagaatcg 


attctggtaa 


agatgttcta 


tatgcacttg 


atgataaaca 


gcgagatgaa 


56220 


atagttactg 


aatttaaaaa 


gaagaagggc 


aagcctaaag 


taaacattca 


aaggtttaaa 


56280 


ggacttggtg 


aaatgaatcc 


acctcaacta 


agagagactg 


tgatggaccc 


tgctactcgt 


56340 


cagcttgttc 


agctttctat 


cagctcaagc 


gataatgcaa 


attctatgat 


ggacttactt 


56400 


ttgtccaaaa 


agaacgcacc 


agcaagaaaa 


gaatggcttg 


aaaagaaagg 


gtctctagca 


56460 


aaaatataaa 


tatgaaagaa 


caaataacct 


caattagcct 


caagcaatat 


gctgaagagt 


56520 


cttatcttaa 


ctatgcaatg 


tatgtcattt 


tagatagagc 


tttgcctaat 


attggagatg 


56580 



Page 41 



gccttaagcc tgttcaaaga agaatactct 
gctcaaagta caaaaaatca gcaagaactg 
atggagacgg cgctgcatat gaagctatgg 
acccttttgt agatggtcaa ggtaactggg 
caatgaggta tacagaatct aagttaacta 
agtctggaac agtcgattgg cagcctaatt 
ttccagccaa actcccatct attttattga 
caacagatat tccatctcat aatattaatg 
ataatccaaa atcacagttg gttgatttac 
p ataattcgcc aataattgct agcaaagatg 
HI gcggcttcaa agctcaagcc caatgggcgc 
\| taccttatca agcatctggg tctaaaattt 
H aaaaaattcc aatggtggtt gatcttactg 
ttgtcataac tttaaaatcg aacagagtaa 
^} catcaactga tttacaaaaa aattatagag 
Jl gaccaaaagt tttctcccta gttgatttat 
ctgtaataag aaaactagaa catagactcg 
aggggttatt aattgtttat ttagatttag 
atgagccaaa aaaagatatt attacagctt 
tccttgagat taggttaagg caactagcca 
gagatacttt ggtagcagaa caaacagata 
ttaaaacatt aattaaaaat gagcttattg 
aatctccaat aagagaagca acagaagcca 
ctgagcctat tacagtagtc ttgtctgcag 



atgcaatgtc 


agagcttggg 


cttgatgctg 


56640 


ttggagatgt 


tataggaaaa 


tttcatcccc 


56700 


ttttaatggc 


tcaaaatttc 


tcattcaaat 


56760 


gttctcagga 


tgatccaaaa 


tcttttgctg 


56820 


aatttgcaaa 


tcttttaatc 


tctgaattga 


56880 


ttgatggctc 


tcttttagag 


ccagtaattt 


56940 


atggcacttc 


tggaattgct 


gtaggaatgg 


57000 


aaattattga 


tgccacagta 


catcttattg 


57060 


tcaagataat 


taatggtcct 


gatttctcaa 


57120 


agctgaatga 


aatttattcg 


actggaaaag 


57180 


aggataagaa 


tcaaatcatt 


atcaacgcat 


57240 


tagagcaaat 


agctgatcaa 


atgcttaaga 


57300 


atgaaggaga 


ccacaaggag 


ccagtaaggc 


57360 


atgctgaaga 


tgtaatgaat 


cacctttttg 


57420 


taaatatgaa 


tttgatttca 


ttgaaaggtg 


57480 


tgaaagaatg 


gctagtcttt 


agaaaagaaa 


57540 


accaggtaaa 


cgataggctg 


catatccttg 


57600 


ataaagtaat 


aaagattatt 


agagaatcag 


57660 


ttaaactttc 


tgatatccag 


gcaaatgcaa 


57720 


agctagaaca 


aatcaagtta 


gaacaagaaa 


57780 


ttgaaaaaat 


actaagctct 


aagacaagac 


57840 


aaataaaaga 


tgagtttgga 


gaagtaagam 


57900 


aagttttttc 


tgaagaagaa 


actcttgtca 


57960 


ctggctggat 


cagaagtgca 


aaaggccatg 


58020 
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agatagaccc cagctcactt tcctatagag gagaagatgt acttcaagat tatggaagag 58080 

gaaagagcaa tcaagtttca gtttttcttg actcaaatgg gaaggcttac tcacttgcaa 58140 

gtcactctct tccatctgct agaggaatgg gtgatcctat tacaggaagg gtatctgcag 58200 

attctggagt aaagttcatt tcttcattga ttgggaatga tgaagataaa ttcatgatta 582 60 

tgaatactgc tggatatggc tatatttcag agtttaaaaa tatggtttct aataagaaat 58320 

caggaaaagc atttatgaaa atccctcatg aagcagacct tcttaaagcc attaaagtaa 58380 

gagacgatca tttgtatata gcagcagttt caaatattgg cagactttta atttttaaga 58440 

ttgatgaatt gccaactctt ggaaagggca aagggaataa aataataaat ataccaaccg 58500 

ctaaatttat agcaaaagaa gagttaatga cccatgcgca acttgtttct gaggctagct 58560 

ctttaaggat tgaaagcgga aagagattcc tcactttaaa actcaaagat ctagaaaact 5 8620 

atatttctac aagagcaaaa aggggaaata tgcttccaca aggatatagg aaggtagata 58680 

aaatgattga agaggttgag ttagaagtta aagaagactg attatagatt tctcaaaaat 58740 

ttttaatccc tcatcaacta gctcattttc tattattagg cttggagaaa acctaacagt 58800 

agatgcattc gccttaagaa tcattaaccc attatgatgg gattttttta ttaaatcatc 58860 

tatttgaatt ttgctatcct tgcttacttc cactccgacc cataaaccag cagaagttat 5892 0 

tttttcaaaa catttgtgct tctcatttat tttattcaac aaattgataa atctaacttc 58980 

tttctttaaa accttattta aaaatgattt tttagaaatg gtatctataa cttcattacc 59040 

tatggcacat gcaattgggc ctcctccaaa agtcgtgcca tgagaccctg ccgacatatg 59100 

ttttgaaact ttatctgaag taagaatgcc tcctagtggg aagccattag atataccttt 59160 

agcaaaacaa agaatatcag gagtaatgtt aaattgctca taagcaaaga gtgttccagt 5922 0 

cctgccaatc ccggattgaa cttcatcaat tatgacaagt gccttatgtt tttttgcaag 59280 

tttctttatt tttgctataa attttttatc tgcttttgtt attccagatt gccattgaac 59340 

cagttctaat ataaccgctg cagtcttatc tgaaaaaacc ttttctagat tagtaatgtc 59400 

attatatgga tgatttttta tccctctagg cagaggcgca aaaccgtctg taagatgctt 59460 
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agctttagct aaagctatac caagcatagt tcttccatga aatgacgttg aaaatgaaat 59520 

gacctcattt ttatttttat taactgttga tgagcaaaat tttcttgcta tcttaactgc 59580 

agcctcaatt gattctgccc ctgaattgca aaaaaatact ttatctgcaa aagaattctt 59640 

gcataacttt cttgccaaag taactgaggg ctcatttatg tagagatttg ataaatgcca 59700 

aagctcctca gattgctttt taagaatttt tattagatct ttatttgaat gaccaaggtt 59760 

agtgacggca atgcctgctg tgaaatcaat atatttctta ttatttaggt cccatacatg 5982 0 

cgatccactc gcttttttta caacaaaatc tgcaggagcg taaaaaggca ccatataatt 59880 

agtatattct ttaattatct ctttcataaa tatattgtat cgtggtttca gttcaaaaat 59940 

taaatgacca aaaatttcta gtatctttaa aaccaaacag ctcactcatt ggctttaata 60000 

gaataatatt tatatctagt atatctattg tatgcggtgg catagcattg atatttttct 60060 

tttttggagc aacgctcatt ttgccttttg ctggtttaga gcttggtatt ctattcactg 60120 

cattctattt aagttttaaa tggagtgata aaaaagaaaa aatatttatt tctcaagatc 60180 

ttgtaactat agaaaaaggc tctaattatg ctgaatataa atgggaagag ttcaggtcat 60240 

ttacctcttt ccaggtttca aaagacagaa gagatcttct taagctaagc tttaggtcta 60300 

aaggcgagga tgttgaggtt ggcagctttc taaatgaaga tgataaaaat gtattaatag 60360 

aagagctaac tcagatcata gatacattaa atcacgattc cttctcaaag ccagagcttt 6042 0 

aatttctttt taacttcttt tagctttaat tcttttattt cagggatgcc atttctaaaa 60480 

tttggaaatg attctccctg tattaatggc ataagatatg caattgcttt tggggtaacg 60540 

tccattccat ttttagctat aaaagatttt ggtaattttt tttctaaatt agctattttt 60600 

gccagtggag ctggttcaat tttccaccta tattttttag ctttacctct aactattatg 60660 

ggcataacac cattcatgcc ttctactgca tactgaactg ccttggcccc aacagccata 6072 0 

gcttgctcta ggtctgtttt ggaggcaata tgtcgtgcac ttctttgtag atagtcagaa 60780 

acagcccaat gattttttag ttttaattta tcagtaatta agttggcaat gtaaggcgca 60840 

actccaccaa gttgagcatg accaaatgca tctattgttg ctgattcaga aagaaatctg 60900 
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ttcttattat ttttcaaccc ttcggatacc acaacaacac agtagccatt ttttttaaca 60960 

acacttttaa cttctgctag aaattttttt tgattgaagg ttatttcagg taataaaatt 61020 

atatgaggtg catctccttt ttcttttctt gcaagagaag atgcagccgc catccagcca 61080 

gcatgcctgc ccataacttc taatataaaa acttttgttg atgttgctgc cattgaccga 61140 

acatctaatg aggcttctag agttgatgtt gctatatatt tagccgccga cccaaatcca 61200 

ggacaacagt ctgttaatac caaatcgtta tcaacagttt ttgggatggc tatacagtta 612 60 

attggataat ttaatttttt acttatctgc gaaactttaa atgcagtatc agctgaatca 61320 

ttaccgccat tgtaaaaaaa atatcctata ttgtgcgctt taaaaacatc aataagcctt 61380 

ttgtactctc tttcgcttga ctcaagatct tttaacttaa acctacatga cccaaaagcc 61440 

ccaccaggcc tatatttcaa tgactctaat gcagatattg attcttttga tgtatctatt 61500 

agttcctcat tcagcgcgcc caatattcca ttcttgcctg cataaatttt accgatgtct 61560 

ttatgcttct tggcctctaa aatcagagcg ccagctgtag catttataac tgcggtaacg 6162 0 

ccgccggatt gagcataaaa cgcatttttt ttcatttttt ctccatcaaa tgtactgaaa 61680 

taaaaatatg taaagtatta taactaatat gcgaatacat atcttaggga tctgtggrac 61740 

ctttatgggc ggccttgcta agatacttaa agagtcaggg catgaaatat ctggatcaga 61800 

tattcaattt tatcctccca tgtcagacta tcttgatagc tttgatattg aaatgatcaa 61860 

gggctatgac ataaaaagca tgccagatgc tgacttgtat gtgattggta atgcactttc 6192 0 

tagaggaaat gaatccgttg agcatatttt gtccaattct cttccgttta agtcaggccc 61980 

tgaaatgctt ggagaaattt taaaaaataa aaaggtttta gctatatctg gaacacatgg 62040 

taaaacaacg acttcatata tgttgactca tattatgtta gatcagggta gggatgtcgg 62100 

ttttctagtt ggtggaatat caaataatat atctggctct gcatgtcttg ggtcagatgg 62160 

aacttttgtg attgaggcag atgaatatga ttctgcattt tttgataaaa gatcaaaatt 6222 0 

tattcactat tcaccaagca ctatagtcat caataatatc gaatttgatc atgctgatat 62280 

ttttaataat ttagatgaca ttaaaagaca attccatcat ctaattaaaa taattccatc 62340 
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aaatggaaat gttgtttatt ttgctgatga taaaaatata agagatctta tcgatatggg 62400 

aatttggtca aatcagatag caatcaataa taatgctcat tcaattgagg cagtttattc 62460 

tgataagact ctaaaatatg aagaaagtat ttattcatta aatgagttac ctttaatagg 62 52 0 

agaacataat tttaaaaact acatttcggc tattttggcg gcaaagacag atggaattca 62580 

aattcaggat tctatcaaat cattagctag ctttgatggc gtaaaaagaa gattagaata 62 640 

taaaggaagt tttgatggca tagaaattta tgatgatttt gctcatcatc caaccgcaat 627 00 

agaattttct tctaatgccc tcgtaacaca aaatccatca aaaaaaatac ttggtctcat 627 60 

tgagctaggc tcaaacacta tgtctggggg ttctcatggc ttgtctttgg ttgaatctgc 62 82 0 

aaaatcttta actcatgtta tctggctaga tcgcaataat gttttgtcag agaatgctag 62880 

cattgaatct actaacacta ctgaagattt tatttctgca gcgatatctg ctttctcaga 62940 

ttatgatatt gttattttga tgaccaataa agacagccaa aaaatattaa aacccattgt 63000 

agatcacttt gaaaaataat aatttaccag tttttccttt aggaatagtc gccctcccag 63060 

gtagcatcca atctcttcaa atttttgagc ctaggtatat acagatggtt aaaacatgtc 6312 0 

tatccaagaa ccatggattt gtaattgttt ttaatgccaa taatgagtct caaggcgatt 63180 

tcactttttc taagaaggga agttttgttg aaattataga ctttaataat ttgccaaatg 63240 

gccttcttgg gataactgta aaatctataa ataaggtgat aatcagtaat atatgtcaat 633 00 

tagaagatgg actgcatatt gctgatatta aggcacagat agatccagag gtagatgatc 63360 

aagctgtttt ggcagaatat cctgagatat ctagcattct ttctcagctt gtaaagcatc 63420 

ctaagattag tgacctgccc atccaggttg actttggctc tgctgattca gttgcatacc 63480 

acttagcagg ccttatacct ctaagctcaa atgagaaaca aaaactatta gaagcattcg 63540 

atgcagcaca gcggatgaga attctttcag actatattga aagaatatct actacataaa 63 600 

ttatttattt taatattatt ggcggcttat tagattttgc tctaagccaa ttgatggact 63660 

taaaaaccgt agggatttct acaaaccttt tttcaatata gtatttgcct ggatagttac 6372 0 

taagcattag cccaataata atagtgaaca aacctggcca ggtaacacca gcatcatgat 637 80 
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tcccccaata ataagggaat aacctaaaat atttttaatc agtaaaacta aataccacaa 63840 

caacgggtta ttggtcttaa atttcgagac atctttcttt ataaaatagt cacttggaat 63900 

taatccagca agccacctca tactaacgag actaaataaa aaaataaata aagatattga 63960 

gctcagccaa agtataagaa ctttataaga ttcaaaaaaa attatgatgg catttagggt 6402 0 

attaaattcc ataattaccc acctttttat ttaataatga tttagtatag caatgtctaa 64080 

aaataaattt aaacattaaa taaatataca ataattttgc caaaaaaagc cttaaatgat 64140 

atttgcactg aaaccctcag tgataataaa gccgaaaatg ttctatcact tgatatcaaa 642 00 

ggtatttctt cttttgctga caacatcatt attgcaactg caaattcgaa taggcatgca 64260 

aagtctttat ccgaaaagtt agttgaggag ataaaagcta ataaaattag tatcatgggt 6432 0 

gtagagggca agacagaatc aggttggata ttagtagact gtggtgaggt tgtagtaaat 64380 

attatgaaga atgacataag agagttttat gatttagaag gtctttgggg tgaaaacacc 64440 

ctcatcgatt cttcgaaata aatgctatta aatatcataa gtgttggaaa ctcaccttca 64500 

tcttgggagt taacaggaat agaatattac acaaagcaaa tccctaaaga agtaagtcta 64560 

aattttatta acgtaaaagg gcagcaacac ccaaaaagat ctacagaaga ggtcttaaaa 64620 

cttgaatcta agttaatctc gtccaaaata gattctaatg gatatattgt ttgctgggac 64680 

tcaagtggcg agtcgttgaa taattttgaa tttagtaaat tttttgaaaa atccatgctg 64740 

gaaaatatga agctttactt tgtaattggt ggttcatttg gcatacctcg agatattctt 64800 

gataaatcta ataagataat ctcgatctca agtctcaccc tacctcacag gcttttcaag 64860 

attgttctta tagagcaaat ctataggtca ttttcaattc tcaaaaatct tccctaccac 64920 

aaatgattga tctgaatgag aggtttgttg aaaaaagaag ctttttcaat agattgttgc 64980 

ttatatattt tttctttggg gccctgtttt tattcttgct tttcaagacc tattctttac 65040 

aggtttctag ctattctgac tatgaactag cagcactaaa gaataaaaca aaagaagtgc 65100 

tagttcagcc tgttcgagga gtcatctatg acaggaatgg aaatattcta gttaataatg 65160 

tccctacata tgacctaata atccaaccat caaaaattaa aaacttagat gatttcatta 65220 
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ttaatatttc 


taaaataata 


gagctatcag 


attcagaaat 


tgaaaacatt 


attaaaaatt 


65280 


ttaaaaggag 


tgcaacttat 


aatagagagc 


taacaattaa 


aaaagatctc 


tcgaaagaag 


65340 


aaatagcaaa 


atttgaagta 


agaagctacc 


aatttcagaa 


tgcatttata 


gatgttaggt 


65400 


atagtagaga 


aaataaatat 


ccttacttat 


tttcacacgc 


actcggttat 


gtaggtggtg 


65460 


taagtaatga 


taaggtttta 


tcaattttaa 


aaaatcaaga 


tttgaagcaa 


tctgaaacaa 


65520 


cttttaagta 


ttcaggtggt 


tttattgctg 


gaaaaacagg 


attggaaaat 


atttatgatt 


65580 


cagctttaag 


agggtccttt 


ggaaaaaagt 


tatttgaagt 


tgatgcaagg 


ggaaggcttt 


65640 


taaaagagct 


aagttttgaa 


aaacctatta 


atggaaaaag 


cctttttact 


catttagatt 


65700 


taaattcaca 


aaaaaaagcc 


tttgaacaaa 


tgaataatag 


aaggggtgct 


gttgttgcct 


65760 


tagagcttaa 


atctggttct 


atagttacct 


acctaagcac 


tccaagcttt 


aatgtaaatg 


65820 


gtctttctaa 


tggcatgtcg 


tcggtagagt 


tcgaaaaact 


gattaatgat 


gtagacaagc 


65880 


cattttttga 


tagggccggt 


caaggtcggt 


actctccagc 


ttctacaata 


aagccagcaa 


65940 


tagcattgtt 


tggtattaaa 


gaaaaaatag 


tagactggaa 


ctttacactt 


aaagatcctg 


66000 


gattttttgt 


attaccagag 


gatcagagga 


tttatagggg 


atggaagaaa 


ggaggtcatg 


66060 


gaacaattga 


tatgaagggt 


gcaatcatag 


aaagttctaa 


tacttttttc 


ttttctcttg 


66120 


cttataaggc 


tgatattaat 


aatttaatca 


gccatctttc 


tgagtttggt 


tttggcagga 


66180 


atgtttgtaa 


agattgtttt 


aatccagatt 


ctggattatt 


gcctacgcct 


gaatggaaaa 


66240 


tgaataatct 


taattttggc 


tggtttaaag 


gagataccgt 


taatttaggt 


gttggtcagg 


66300 


gctatatgag 


tgccactcca 


gttcaattag 


catattactc 


tgcatttctg 


gcaaaaaaag 


66360 


gaaatcttca 


agagctatca 


tttgtcgaga 


gtgacagcct 


aagcaatact 


gcttttataa 


66420 


ataatttaag 


catagataac 


tcagactggg 


atcaaattca 


ctcaagcatg 


attggagtta 


66480 


ttgaagatcc 


gagaggcact 


gctaaaagat 


taaaaccttt 


aaaatcatat 


gttgttgctg 


66540 


caaaatctgg 


aacggttgag 


cttgtcagca 


cacaaacaaa 


ggaagattac 


aaactagtaa 


66600 


ggcaaaatat 


tggtaacaga 


gaccatgcaa 


tcattgtggc 


ttttggtcca 


atgcctgatc 


66660 
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cagagtatgc 


agtaagtgtt 


gttattgaaa 


atggtgaaag 


cggcggttct 


gttgcaggtc 


66720 


ctgttgctat 


tgctgtttta 


aatgagctta 


tcaataaatg 


aagaaaaaat 


tagactttaa 


66780 


aaactttagc 


atttattttg 


atcaatattt 


atttattgcg 


ataaccctgt 


tgtcggtcat 


66840 


gggtttattt 


tttttataca 


gcgcatcaca 


agaagatatc 


agcactgttg 


ctaagcaagc 


66900 


tgtattcgtt 


ggttttggtt 


tgctgttaat 


gtttgtagtt 


agccaacctg 


accctgattt 


66960 


ttataataca 


ttttctgggt 


tattttttgg 


ggggggastt 


gtattgattt 


ttctaactat 


67020 


gatttttggt 


aaagaaataa 


atggagccaa 


aagatggctt 


gatttaggat 


tttttaccct 


67080 


gcaatcctct 


gagattatta 


aaatatcatt 


gccaattttt 


ttatcatcat 


atttatataa 


67140 


taagccactt 


ccaataagca 


ctaggcatac 


ttttattaca 


ttgatattaa 


taggttttat 


67200 


atttgcactt 


gtagctagac 


aacctgactt 


gggaaccagt 


ttagtagtat 


ttatgtcagg 


67260 


gggttatgta 


ttatttttag 


caggattaag 


ccgccgttta 


ttggatctgc 


aatagcctta 


67320 


tttttattat 


ccttacctbt 


tttatggaat 


aactttctag 


agccatttca 


acaacagaga 


67380 


gttttaactt 


atttgaaccc 


agacgcagac 


ccttatggta 


ctgcatggaa 


tataactcaa 


67440 


tcaaaaatag 


caattggatc 


aggcggaatt 


aatggaaaag 


gttatcagga 


gggctcccaa 


67500 


gcccatcttg 


atdttttacc 


agagacagaa 


acagatttta 


tttttgctgt 


tattgctgaa 


67560 


gagtttggtt 


ttattggagt 


ctgtattttg 


ttatcagtat 


ttdtctttat 


atkactcaga 


67620 


tgtttatatt 


tagcatttaa 


tgcaagagat 


agattttgca 


ggttaactat 


aggaggccta 


67680 


agtttagttt 


ttgcctctac 


attatttatt 


aatttagcaa 


tggttgttgg 


tgtagttcct 


67740 


gtagttggta 


tgcctcttcc 


atttatcagc 


aaaggcggct 


catctttgct 


atcctkttat 


67800 


atagcttttg 


ggattataat 


ttctatggca 


acacataaaa 


aattaatgca 


aagatgaaaa 


67860 


aaattatatt 


tataacttta 


atattcacta 


tttcaattac 


tgcggattat 


tcgaatcatg 


67920 


aagatagcca 


aatggtaata 


aatgaactcg 


tcacaaaaca 


cggttttgag 


gaatcttatg 


67980 


ttactgcaat 


cttaaaaaat 


gcaaaaaagc 


gtgatgagat 


gcttaaatct 


gttgctaatc 


68040 


ctgctgaaaa 


aacaaaaaca 


tgggatgaat 


atagggctat 


ctttataaaa 


acaaaaagag 


68100 
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tttcagaagg caaaaaattt ataaaaaaaa atattaatgc tttagagagg gctgagaaag 68160 

aatttggagt tcctaaagaa ataattactg ctattttagg cgtagaaact aattatggca 68220 

gcaataaagg tggatacaga gttttagata gtcttactac tttaggtttt gatgacccgc 68280 

gaagatctaa tttttttaga agagaactta tagagttctt ccttttaaca agagaaaata 68340 

atttagatat caaaacaaca aaaggctcat atgcgggagc catgggatat gcacaattta 68400 

tttcgtcaag ctaccgagcc tatgctgtag attttgatga agatggttac gttgatttat 68460 

ttaattctgt tgatgatgca attgggagta ttgcaaatta tctttatgtt catggatgga 68520 

agagagaagg aaagatcgta acaaaaactt atccaaacaa tgttagaaaa ttttataaac 68580 

ctcatgagtc tctaacaagg ttcatacctt taatctttaa tgaagatgga aaagatcttt 68640 

tttttattgg tgatgataac tttagggcta ttgctaagta taatattagc gatgtctatg 68700 

caatggcggt ttattactta tcagaagagt taaaaaaatg aaaaaattat tattcacatt 68760 

actatcaact tcaatattta ttcaagcaca gagttttgtt ccagattctc ctgagttaga 6882 0 

tctgaagagc tatatcctaa ttgagccaaa caccaatact gttattgcag aatttaattc 68880 

agatttggaa atagaaccag ctagcatgac taaaattatg actagctatg ttgtggctga 68940 

tcagattgca aatggtttaa tatctcttga tgacgaagta ctaattagtg aaaaagcatg 69000 

gagaatgcaa gggtctaaga cctttataga agctggaaaa aaagttacgg tatctgatct 69060 

tttaaaaggt attatgattc agtcaggcaa tgatgcctct gttgcaattg cagaatatgc 69120 

aggcggcact gaaagaggct tcgttgactt aatgaactct tatgctgcct ctttggagat 69180 

gaataatacg atctttcaga attcaacagg gcttccagat gaaaatcatt tttcatcagc 69240 

aaaagattta gccaatctga cagctaacta cattaataaa tttccagaag aatatgcttt 69300 

atataagcag aagcaattta cttttaacaa tattaagcaa ttaaatagaa ataagctttt 69360 

atggagagac gactcagcag atggtgttaa aactggtcat acagaagctg ctggatactg 69420 

tttagttggc tcagcaaaaa ggggcggtat gaggcttatt acagttgttg caggaagtaa 69480 

gtcagataat gatcgttttt tatcttccca aagattactt gagtatggat ttagattttt 69540 
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tacaacacag aagatgttaa gtgcagaaaa agaatatcag aatattacag tttggggtgg 69600 

acaagaaaag atacttggcg tgggagttct agaggatata tctattactc ttcctagaac 69660 

aagtttcaaa aatgttgaaa ctgtttataa agttaataac aatatccagg ctcctatcgt 69720 

agttggtcaa aaagtcggga ctctagaaat aattagtaat gatgagattg ttctagttac 69780 

ggatttggtt gctttaaaaa atattgaagc taaaggtttt tttggaagaa tatggtcaaa 69840 

gttcgttctt tggatattta gcttatttgg tctaacagat gaaaacacca cttaaaggcg 69900 

tttttaatgg ggcttttgat actgtagata acataaaaat atcacctttt tctcgagctt 69960 

atactttttc tgatagcgta tatgaagtag ttcctttctt taattcaact gcaattgctt 70020 

ttgatgatca tataaaaaga ttagaatttt ctgctagtca actagcgatg gccgtagatt 70080 

taaaagaagt tatatttgaa attaattctt taataacatc atctgaattt agtaatggct 70140 

atgtctatta tcaagttact cgaggcgttg atccattaag gtctcatatt catgagccaa 70200 

atttaaaaat agaaaccttt ggctatgcaa aggcgcactt attccaatgg aaaccattaa 70260 

gagtatcagt atgtgatgat attaggtggg ggaggtgtga tattaaatca acatcattgc 70320 

ttggtaatgt tatgaacatg aatgcagcaa aactagataa ttgtgatgaa gttattatgc 7 0380 

acaaagataa tttgttgacc gagggtggcg catctaatct tttttttgta aaaaatgatt 7 0440 

caatctgcac cccagctcta aatggaaata tacttcccgg cataaccaga gcattactaa 70500 

ttaatgaatt aaaaaattat agtatagagg tcatcgaaga taacttcagg ctagaagatt 70560 

tatcaaatgc ttcttgcgcg tggctcacaa gcgcaacgaa agggctggcc cccatttctg 70620 

aaattagtaa cctagaatct cacttggatt tataccatcc tcttttcaaa aagagtgaag 70680 

aaatttttaa taagaagttc cttagttaga agtcagttta ataactatat tatcaagctc 70740 

gtaccaaaaa tctttatcac taagaccttt atttgcaaga tcaagctcaa atacattttt 70800 

ttggagctgt attaagtttt gtagtttatg atttttaata aaatgcatat attcagaagt 70860 

tttatttttc caaaccccag atttaattag actatctaca ggatttttac tttgtctagc 70920 

attggctgca ctgttaataa ttttaccaac tatccaaact agaagaggtg cataatgatc 70980 
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ttccgctgat tttattgaat ggataatttt taaagcttgc ttggtattat ttgaaataat 71040 

tttatcttca agctcaaatg gcataaattc tgcagactcc acagaaggtt ttctttcttg 71100 

gccatcacca tctttataag tcagtcttag aagctttact tcattctgtt gggcaactaa 71160 

gtttccagag ttcatttcag atatattttg cactaaagag ggcctatctt tttcagaaat 71220 

aaaagatagt tgatgcttaa gccaaatctt ttcttcaaaa gatttaagtt ttccacaatc 71280 

aatgatgagg gctatctcat ccatctgttt tacccactta gttgtgaaat taagcttttc 71340 

attgcttgaa ttaataatta ttgcaatatt atccatatgg ttaatatttt caatttgaga 71400 

tattttaatt atttggtcag gtatttttcc ttgatcatga ttaatctcaa ctattatttt 71460 

tgagccaaat aaagatccag aagcattttc aataatagtc tgatctattt tatcaaaccc 71520 

P^tctttatta ataattgttt tttctttgaa ccctttattt gataaatgtt tgagcaaaag 71580 

gfitctttagag ttatttttta agacaacctc agatccaaaa ataaagaaga tattttgtga 71640 

Cctgatctaaa tatttttgag ttgtcattgc ctcacatttc aaaagcatgc acctcaaaaa 71700 

Mttatttgatc aattaattcc ttctgcattt cagactcaag agatttaatc atttgatctt 71760 

-gtgcaaatgg attaagttca ttcgatttat atctcttgta gctaacaagc tttttactta 71820 

tltttgtttatt tccaatcatt tttatatgta tttctaaatt aatttctcct tcaagagctc 71880 

Httagagaaga gcctccatag atatcatatt tatttaaaat atagtcttgt attagaatct 71940 

Hgattttgacg attctcagct tctttttctg tattgaagag gattgctaac ttttgctcaa 72000 

aagaattagg gacactacta tcaaaccgaa aattaaatag atctttttta tctagatcta 72060 

tatttagttg attaaattgg cagccagtta aagatatcaa caagcaagtg tataaaaaga 72120 

aaggtcttag tttcataacc tcatagtatc tatctgaaga atatatatca aagattaatt 72180 

ttaaataaca aaatttatta ttttttcttt aatataaata gtttttttaa tagccgagtt 72240 

gtttattgtt gatgccacat tgtcgatagc cagtgctaaa gcttctatat ctttttgctc 72300 

aaggtttttg tctatcatta ccttacctct cacctttcca tttacttgaa ctattaaatt 72360 

aaattcagaa acttctaata actcctctct aagaacaggc caagaagatt caatttcttc 72420 
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ttgtgcaaag tcaaagtaaa aattattcca taaatgttgt gagatatgag gcgcaatagg 72480 

attgagagtt ttaagaataa ttattattgc ttcattaaga caatattgat ttgagatcga 72540 

cgcattacct tctttaaaag aatcaggtat aaagttaact agctccatta tcgaagcaat 72 600 

ggctgtatta aatgaatatc ttgtttcaaa gtcatttgta acctttttaa gagtattatg 72660 

cgattttctt cggagctcta cttcttgttt cgaaggatcc ttgggctcct ctaagtcaat 72720 

aaattttcta ttgctgacta agttccatat ttttttcata aatcttgagg caccttctac 72780 

agatgactct gaccattcta gactttgctc agggggagct gtaaacatca tgtaaagcct 72 840 

tacagtgtca gcgccatact tttctatata agattgagga tctacagtat ttccttttga 72900 

tttggacatt ttagccccat ctttaagaac catgccttgt gtaagaagct ttttaaaagg 72960 

Stcattgcct tcaactaaat ccatatctct aagcgcctta tgaaaaaatc ttgaatatag 73020 

flaagtgcagt atcgcatgct caatacctcc tatatataaa tcaacaggca gccaatattt 73 080 

Ntgaattttta tcaaacattt catctgcatt gtcagatgaa gtaaatcttg cgtaatacca 73140 

ftgatgagtcc ataaatgtat caaaagtatc tgtctctctt tttagtcgat cagaaatatt 732 00 

^ataaaaatct tcattttgac ttaagggaat aggtgccgag ttctttttta gctctggaag 73260 

HJactataggc atatcctttt catcaataac tcttggttca ccattttcat aaacaaccgg 73320 

taatagggcaa ccccaatatc tttgtctgct cacaccccag tctcttaatc tgaactggat 73380 

P taactgctca ccaaggtttg cgtctgctaa atctttaata atttcaagag atgcctcatc 73440 

agagtccatg ccatcatatt tatcagagtt tattaatttg cctttttgta caactggaag 73500 

ttcattatta ttatcaaaac taataacttg aggaatctct agattatatt tagatgcaaa 73 560 

ttcgaaatct ctttgatcat gagcaggaac acccatcact actccagttc cataatcaag 73620 

aagtacaaaa tttccaatcc atacgggtat cttcttcttt gtaattggat gaattacatg 73680 

cattccacta ttaattccaa gtttttcagc tttagccata tcagcttcag cagctttcac 73740 

ttctttgcat ttgtttagaa agtcttttat cgattcatta ttttttgaca accctattga 73800 

tatagagtga tttggtgata ttgctaaaaa agaaacacca aaaattgtat caggtcgtgt 73860 
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tgaaaatacc tttaaagaat catcggaatc 
agatttccca atccaattcc tttgcattgt 
caaagatgtt aaaagttcct cagcgtagtc 
ttttatttca acttgcgccc cagacctcca 
taaaactgtc tcatcaactg gatcccagtt 
tttatcataa aactttttaa atattaactg 
tctaagctct tttgaccaat catacccaag 
ttcaatattt tgattcgtcc aatcttttgg 
agcaggaagc ccaaatgcat cccagcccat 
tttatatcta gatatgacat caccgatcgt 
acctgatgga tatggaaaca ttgataggca 
tttaaatttg tcttctttta accattcatc 
^atattcagga ttcatttttt taggaactca 
^'tctttaaagg tatcatcatg cagaatcctt 
Itctataaaaa attcatctaa agcactcagc 
!gcctgagtaa tgatttttgc tagaagtgaa 
^taaatatgtg agtcatatct tatgccaaaa 
cctggggatg gttgaaaatt atcaggatct 
ccatgaaaat taatatcatc ttgatttagc 
tgtgctttta ctaaatcaaa gccagttatc 
cttgtattca tttctataaa atagaattga 
accccttcgt aatttatttc ttcacataaa 
gcttcttgat ttatattaag tgctggagct 
atactgcaat cccttgtacc taaatgtatc 



ctcaatcatg tatttaattt ctgctccctg 73920 

tttaacattt tcaggccaat ccacctcatc 73980 

tgtaattttt ataaaccatt gatcaatttc 74040 

tccttttcca tctataactt gctcatttgc 74100 

cactaatgat tttttcctat aaaccaaacc 74160 

ctcccactta tagtattccg gttcacatgt 74220 

gcccaaagat ataagttgct gtttcatatg 74280 

gctaacttta ttagcaatcg ctgcgttttc 74340 

aggctgaaaa acattaaagt tattcattct 74400 

atagtttctt acatggccca tatgtaattt 74460 

ataaaatttt tctctaccat caggattagc 74520 

ttgaatggtt ttctctattt gacttgggtt 74580 

ttttctaagt aattaatagt . atgtttattt 74640 

tgatgaagag ggtgatttgt tttaatgccc 74700 

attctcttaa tagccgagct tctagaattt 74760 

tcatagttag ggggaactct atatcctcca 74820 

ccgccaggtg tatgcatttt tgtaattgtt 74880 

tctgcattaa ttctgcattc tagtgagtgg 74940 

tcaattggca tttcaagagc aattcttagt 75000 

atttctgtta ccggatgttc aacctgaatt 75060 

ttatcttcgt ataaaaattc aatagtgcca 75120 

ttgacacaag ctattagtgt tttattaaga 75180 

tcctcaatta ttttttgatg tctcctctgc 75240 

gcctttcctt taccatcacc aactatttga 75300 
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acttcaatat gtctagggtt cccaataaat ttttctagat aaatggtttc attgccgaat 75360 

ccatttttag cttcctgcat tgttatttct gcatgtccaa taagatcttc ctctctttca 75420 

acaactctca ttcctcttcc accgccacca gccgttgcct taatcataat tggatagccg 75480 

atatctctag caattttttt aaactcatca ttatcagatg gaatctcatc tttataacct 75540 

ggaacgattt gaattccaga tttttcagcc agtgtttttg ctgttatttt gtcgcccatt 75600 

ttttggattg tttctgaggt tggcccaata aatttaaatc cacttttttc acacatttct 75660 

gcaaaattat gatcttctgc aaggaaccca tatccaggat atatagcatt agcacttgta 75720 

agttctgcag ctgagagaat ggcaggtata tttaggtagc tttgagttgg agatgcaggc 75780 

ccaatacaaa cagtttcatc agagaatctt aaatgtttta gatctttatc agcttctgag 75840 

taaacagata ctgtcttgat accaagctct ttacaagctc ttatagctcg aagagcaatc 75900 

Jtcaccacgat tggcaataag aactttataa ctcattttaa ttgacagtta taattttttg 75960 

; tccaaattca accggacttc catcttccac gtcaatactt gaaatcttgc catcaaattc 76020 

'tgatttaatt tcattcatca ttttcatggc ttcaactata cagagcacat cacctacttt 76080 

; tatattgtct cctactttta caaatggatc tttttcggga cttgggcttc tataaaaagt 76140 

«tcctactatt ggagaggtaa caacatcccc tatgacagtt tgcttagcct catccaattg 76200 

^atgagttgtt ggtgagatag ctggctcatt tttgacaatg attggttttg aaacaaattc 76260 

jttgattagag ccgttatctc ttgaaattct tactgactca tctccttgac ttacttctat 76320 

ttcttttaaa tcagactctt gaagcatttc tataagtttt tttattttcc taatatccat 76380 

ttatttaccc ctgtatttta atatgccctt aagcatcgcc tcttcataac ccttagcacc 76440 

aataccagtt atcacttctt cagctatatc agatagatat gagacatgtc taaattcctc 76500 

tcttgtatat atgtctgaaa ggtgaacctc atagaatggt atgttgacac caagaaatgc 7 6560 

atcccttata gctatgctcg tatgagtata tgcagcagga tttattatta caaagtctat 76620 

tttttcttca atagtgctat gtattgattc tattatttca tgctccgcat tactttgaaa 7 6680 

agatattaaa ttgcattgat tttgatttgc taagactaat aaatcttgtt gaatatcttc 76740 
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aagtgatttg ctgccatata cttccttttc tctagttccg agaagattga ggttggggcc 76800 

atttataagc aatatattca tataatagat tttaacagaa atttacagat tttttaggat 76860 

tttaaagata tttttataag agttgatttt tttgcatggg atagcaaaaa cccgcttcag 7692 0 

agcatccctg atagtatatt aaaacctccg ataagttcaa tgaagaattt atttcaatgc 76980 

ttataataaa ttcatctcta agaatctcag tctcgccaaa aaattcatcc ctatatagat 77040 

ctttacttga ctcaagcgtc ttaaatggga gcactttatc tttaaatttg agctctatac 77100 

tatccaagta catataataa ccatctctta tttgccaagt tatatatgct gcttgttttt 77160 

caacattggt tgttaaagca aatacttcat tggcttctgg gaccctattt gagttttcaa 77220 

aaagatttgc tgaattcaat tcacctgcaa tcacattgga tagtagtata attattaaac 77280 

ttttgatcat gaagacttaa gaataaaccc ctaagtactt aaaataaaga ttttattaaa 77340 

taacacggta gttaaaaatt tatgttccag actttcgcac taaaaatttt cttttatatg 77400 

cctatttggc ttttaaaaat aattttttat agaaaaagaa cagtaataag agggcatcaa 77460 

=tttgatgctc aatctgctgc attattgtct ttgcttccaa agaaagattt atctgaatta 77520 

^ttagatggtg aaattgcaaa agctagaatt actcttgaag aagcaagaat tcaaaataaa 77580 

jgtttctttaa caccgtctat acaagtcaga aaagtagatc atattttacc aaagcatgat 77640 

fttaattctca gagaatacaa gccgcatcaa gaggatttaa aaaaagttat tctctatttt 77700 

Tcatggaggag gttatgtcct taactcagtc aatacacatg atgatatggt ttcatatatg 77760 

tcagataaaa taggagtaaa gttttattct ctagactata gactatcgcc tgagagcaaa 77820 

tatcctgatt ctcttgatga tgctctagat gctttttctt ggctcatcgg ccagggatat 77880 

ggaccaggtg atatttcagt ttgtggagac agcgcaggcg ctcatctagc tgcttctttg 77940 

tgccattacc tagctgaaca aaataaagat ttgcctagta gccagctact aatatatcca 78000 

atgtgtgatc cgtcctgttc atctgagtct tataatttat tatcttcagg atatcttcta 78060 

actaaaaaaa ctatgatttg gttttgggat aagcttagaa cttccgaagt aaacaataac 78120 

gattcggcct ttaatctctt aaaatttaat tttgaaaaaa ctttaccgcc aacaattatc 78180 



Page 56 



gttacaggtg gctttgatcc tctatgcgat 
aaaggtaaac ataatgtgaa acaattacat 
atgactaaat tgaaagcagc gcagatagcc 
atactatgag taaaatttta gaagtaagtg 
gattgtttaa tgcagtggat aatataagtt 
tggttggtga gtcaggttct ggtaagtcgg 
aaaaaccaca agcatcatat tccaaagagt 
taaatgccaa gtatgaaaag ttactttcct 
aagagccgat gacctcacta aacccttatc 
tactacttca ctcaaaaagc tcaaaaaaag 
Jcacttgttga gattgatgat gttgaaagac 
-gagggcagcg acaaagagtt atgattgcta 
"ttgctgacga gccaacaacc gctcttgatg 
'"tgtctaagct taaaaatgaa ctaggcatgt 
Jtcgttcaaga attttctgac aatgtttgtg 
jjaaatactgt tgaggtattt aacaacccct 
iagaacctca gcccaaatta gataaccctg 
acttaaatgt ttactactca ataccttcaa 
ctgttaaaaa tacttccttt agtatttata 
ctggatctgg aaagtcaact ttgggtaagg 
atattaaatt tgagggaaga gatatcaact 
aaaaaaatgt ccaaattgtt tttcaagatc 
tgggggagat agttggtgaa ggtttaggtg 
acgaaaggat agataaggtt ctgtcagatg 



gatggagaaa aatatgcata tttattacat 78240 

tatccaacaa tgtttcatgg gtttgcatca 78300 

gttgaagatt ttttaaaaga atataagaaa 7 8360 

acttaagcat caatttttca acaagggatg 78420 

ttgatataga aaaaaatcaa accttggcct 78480 

taactgctat gtcaattctt cagctccttc 78540 

cttctattaa gtttaatggc gatgagataa 78600 

tgagaggaaa tattatatct atgatatttc 78660 

acagagtagg taatcagata actgaatcaa 78720 

atgcaataga tgaagcaaaa aaattaatgg 78780 

ggttctatgc ataccctcat gagctttctg 78840 

tggcccttgt taataaacct gagctattga 78900 

taactatcca agcccagata ttagatctca 78960 

caatactttt tattactcat gatctaggcc 79020 

ttatgaagaa tggcaagata gttgaacaag 79080 

ctcacgaata tacaaaaaaa cttttagatg 79140 

taagtgatga gccaataatt gagatcaatg 792 00 

ctaatttttt taaaaagaat acttttcatg 79260 

aaaatactac aattggcctg gttggggaat 79320 

ctatagcaaa cttagtttct tataagggga 79380 

caaattccca aaaagaaaat aaagaattga 79440 

cttatgggtc attatcacca cgaatgacag 79500 

ttcactttaa gcttacaaaa aaagaaagag 79560 

tcggtatcga aatagtagct aagaataaat 79620 
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atccgcatga gttttctgga ggccaaaggc agagaattgc aattgctaga tctttaataa 79680 

tgaatcctgc ttttatgatt cttgatgagc caacatcagc attagatagg tctattcaaa 79740 

ttcaggtaat cgatttattg aaagagatac aaaatgaata tgggcttact tatcttttta 79800 

taagtcatga tttaaaggtt attagatcga tgtcagactt tatttttgtt atgaaaaatg 79860 

gagaaatcgt agagtcagga ccttctcaca aggtctttga aagcccagag caagactata 79920 

ctaaaaaatt actatcagct gctttaaagt atgcatctga ataattaaat atatymciritm 79980 

tggcaaatag aaartattca aaagagctcg ttgacggtcc taatcaagct gcttctagat 80040 

caatgcttag aggagtaggt ttcacatctg aggmtttcac aaaaccattt gttgggattg 80100 

cttccacagg agcaaaagta accccatgca atatgcacat aaatgcactt tcagagatcg 80160 

ttgagaaatc agttgatagt tcaggaggaa agggtgttct ttttaatact attactgttt 8022 0 

ccgatggaat ttctatgggt acacagggta tgaaatattc tcttgtttct cgagaggtaa 80280 

*ttgcagattc aatagaaact gttgtgggat gtcttggtta tgatggagtt attgctgtcg 80340 

Igtggttgtga taaaaatatg cctggatgca ttattggaat ggcaagatta aatagaccat 80400 

^caatatttat atatggtggt tctatcaaac ctagtaaaga aaataccgac tatgttactg 80460 

itttgtgagaa aactggagag tactcaaaag gcgatcttaa agaatctgaa ttaattcatg 80520 

'Ttagaaaaaat ttccgtaaaa gggcctggat cttgtggggg aatgtatacg gcaaatacta 80580 

Tbggcttctgc gattgaagct ctaggcatga gtcttcctgg aagcagcagt caagatgcaa 80640 

tttcacacga caaagaagat gattgtttta aggctggcga agcgataatg aatttattag 80700 

aaaaagatat taagccttca gatattatga ctaagaatgc ttttgagaat gctataacaa 80760 

tggtaattgc tctaggaggt tcaactaatg cagttctgca tttattggcc atggcgcatt 8082 0 

caataggggt tgatttagag ctagatgact ttacaagaat aggaaaaaaa acacccgtta 80880 

tggcagatct taagcccttt ggttctcatt atatgtctga actcaatgct aatggcggta 80940 

ttcagccact aatgaaaact ttgcttgaga agggattact acacggcaat tgtcttaccg 81000 

ttaccggtca gacgcttgct gaaaatcttt ctggaataaa accttacgag cctgatcaag 81060 
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agataattaa atcatttgat aatccaatta aatcaaatag ccatcttaga attctgtatg 81120 

gcaacttagc gaaagatggt gcagttgcaa aaattacggg taaagaagga acttcctttg 81180 

aaggaagtgc tcgtgtattt gattcagaag aagaaggggt taaagcaatc ctatctaaat 81240 

ctataaaagc tggagatgtt gttgtaatta gatatgaagg gccaaaagga ggtccaggca 81300 

tgagagaaat gctaaaacca acatctgcca taatgggtca aggtcttggc gatcaggtag 813 60 

cttttataac agacggtcgt ttttcaggag gcactcatgg atttgttgtt gggcacatta 81420 

ctccagaggc tgcagatggc ggcttaattg caataattaa agatggcgac tctatattaa 81480 

ttgatgcaga ttctgataag ttaattctta atatttctga ggatgaaatt tcaaatagac 81540 

taagtagatg ggtaaaccca aaaacgcctc ccaaaaaagg agtcttagca aaatttgcaa 81600 

aaagtgttaa atcagctagt cttggagcgg taacagatta aatatgtatt ttaaaagaaa 81660 

atttccaaat agtagactaa gaagaatgcg tctcaattca aacctcagag acttgcttgc 81720 

" = tgaagttagg ttgtctacaa atgatctaat tcagccatta tttataaaag aaggcttgag 81780 

"tggaaaagag gctattgaaa gcatgcccaa tattaataga tatggacaag attcaatttt 81840 

"ttcagaaata gaagagctac tagagcatga tataaatacg attgccctat ttccagttat 81900 

Jtaatgattcc aagaaaaaca gcactggaga tgaggctatc aatgcatcta atttgatgtg 81960 

igaaactatc agtaacataa agaaacgatt ccctgaaata atcttaatat ctgatgttgc 82020 

T tctagaccca tataccgatc atgggcatga cggtatttta aaaaatgatt atgttgataa 82 080 

tgatgagact ttagctgttt taaggaagca atcactaaca ttagctcagg ctgggacaga 82140 

cataattgca ccatcagata tgatggatgg aagaataggc tctataagag aggcattgga 82200 

cgaaatggat tataaaaata caatcctact atcatatgca gcaaaatata attcaaagtt 82260 

ttatggtcca tttagagatg ctgttaattc agcttcaagt ctaggcaagt cttcaaaatc 82320 

cagttatcaa atgtcaccaa aaaatataaa tgaagctcta catgaagttg ctatggatat 82380 

taatgaaggt gcagacatag tcatggttaa gccaggtatg ccttatctag atataatttc 82440 

aaaagtaaaa gaaaccttca aagtacctac ctttgcatac caggttagtg gtgaatatag 82500 
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aagatattga gtggtggagt tgactccaat 
gctgctagaa atcttgaaga gggtggaagt 
acaggctcaa agatggatga agttatttat 
attcaccttg aaagaaaaat agccgaaaaa 
tcgggaacaa gaagagaaga tttacttact 
ttaaggaaaa ttttagacga tatggaagat 
ttgaaatctc ataaaacaaa cgatgagttc 
aagactagat aaagtttttt gccatatcaa 
cgtgaatttc taattcttta aactcatttt 
caataaatat tttttttctt aaaactgcat 
tactaaaatt atagattaaa aaaatttcat 
gcaaatagat tacttcatag caaactatct 
r; gaaaaccatt tgaattttcg ccgcaaaata 
-ttaacttaag aattccttca gatgaatggc 
Icaagaagttt attggaagtt gcgggcccaa 
"Jaatcgaaaaa agattttaaa atatcaagtc 
Ittaagtttga atatgaatga atatttttaa 
ttgaattaat tttagaaaga taaatatttt 
atattaggtt tcctgataaa tttttgggcc 
gatttcgctc cttccaaaat aaactcatca 
tgaaaggaag atatcttttc tttataaatt 
cctgatattt ttatttggcc atttttatct 
gtaccttcca tggcagcaac aaaagatcgc 
tcaccaatat tttctaaaag ctcaataata 



gctcttgaaa ggccaaaaag gttttttggt 84000 

ctcactattc ttgctactgc tctagttgaa 84060 

gaggaattca aaggtacggg taatatggag 84120 

agaatatatc ctgctattaa tattagaaga 84180 

gctgaagatg aattacaaag aatgtgggtc 84240 

gctcagtcaa ttcagttcct aatagataga 84300 

tttacttcaa tgaaaggggg taatggcaag 84360 

tcatcatctt atcagttggg ctttgcgata 84420 

gacatctatc ctttatattt tctgaagcaa 84480 

cgtccagatt tgttattaag aatttaagag 84540 

tattatctgt aacttttggt atttgctcca 84600 

catctagact agccttaagt ttttgttgaa 84660 

aaagactctt tcctagaaaa tttttctcta 84720 

tttgtggaaa atgagatttt atgccacttt 84780 

cagacaaaaa attgtgtgga agatcatcta 84840 

cgtacgaagc agcagcttgg ctagtaaaaa 84900 

ttttatcgat agcgattttt gaagggtcga 84960 

taagagctat ttcttgagct tcacataaag 85020 

tcgtattaat tatcattttt taaaattaaa 85080 

gcaaataaca taatatctat tgcaatatct 85140 

ttttcgttgc cttcataaga taatattttc 85200 

tcacataaaa tagctattgg agacaggcaa 85260 

tctgcactgg caagaattaa ttctttaggg 85320 

tcttttttat ctgacagaca ttcaattgct 85380 
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atatatccct gagatgcaga aggtaacatt tcttcaattg agaattcata tgaattttgt 85440 

tttaaaccaa gcctttttat agcagcctta gcaactacca gcccatcaaa cagaccatca 85500 

ttgagtttct ttattctagt agctatgttg cctcttactg gaatagtttt tatatcaggt 85560 

cttaagttat taatttgtgc ttttcttctt ggaccagaag taccaatagt tgaatttcta 85620 

gcaaattcag aaaatgattt cccatcctta gatagaagca ggtcttctgg agattcccta 85680 

ctaaaaacac ttattatttt aaactttgga tcaagctttg ctggaacatc ttttaaacta 85740 

tgcactgcaa tatctgcttc atcagcctct agagaggact caagtgtaga aatgaatagg 85800 

ccctttccac caatttcatg aagaggttta tctgtttggt ctccttctga tgtcatagga 85860 

acaagctcaa ccttaatatt attgatctta gctaataact catctgcaac aaattttgcc 85920 

tgatacattg ctagttctga ttgccttgta gctattctta ttttcatttt gtttccaata 85980 

"'aaagagcctt cacatcgcca acactctttt ctttcaagat agtgaattcc tcaggaattt 86040 

;ccaaggtagt aaatttacta tactcaaggt agattttaca agatggcttt atttcattct 86100 

'ttcgaattat agattttaga acttttaatt catattcttc gccaaaagga ggatcaagaa 86160 

"gaattaaatc aaaacgagat aagtcatttt ttttaatcca actaaatgca tccttaaaaa 86220 

^aaactttaga tttatccttg atgccaagaa gttgaatatt ttttgctaac acagaatagt 86280 

Xtttttttatt aagttcaacg aacacaactt tttttgactg tctggatatt gcttcaatac 86340 

^caagcgcacc cgttccggca aacaaatcaa gacatattaa attttcaatc tcaaattgaa 86400 

gccaattaaa aattatttct tttaatttat ttgaagttgg tcttaaggaa tccttaaatt 86460 

cgaaaggtat ttttttacct tttaaataac ctccagtgat tctgatattg tttttcattt 8652 0 

tttttgcaaa atgggtatgc ctatttatta aataaatata attamaaata catttaaact 86580 

aagttataat tcatccatgm gtcctacaag ataattttag aaaagcaatg aggagctata 86640 

tttactctgt cagtgtgatg tcaaatgttg acgagaataa aaagtttagt gctataacag 86700 

tttcttcagt tacctctgtt tctttagatc ctcctagttt gctcgtttgt atcaataaat 86760 

cagctggaat tcacaactca ataaaagaag ggtcctcttt ttgcataaac cttttaaata 86820 



Page 62 



aaaatcaaga agatatttct aatctatgca gttcatttaa gaccgaaggt gatagattta 86880 

atagtggtga ctgggattta agcggcactc cgtttttaaa aagtgcccag gctaatattt 86940 

tttgtactgt tgatcaatta atttcatacc acacacatac tattgtgatt ggtcatgtca 87000 

caaactctct tagcgatgag aaaattaata cactgacata tgttgatggt agctatgaat 87060 

aaattttcaa aaaatgtatt ttttatttta ataatactca actctttttt tcttgcctct 87120 

aacatttttg ctagccagga agagtgtgaa gaaaagccaa gtgtttttat tatctctcct 87180 

caagacggtt ttatctcaga atctaataat gtaaaagtct tatttggatc aaaaaatatt 87240 

gaaataaatc cagctggcaa aggtgagatt gcaaaaaata aatgttttgc aagcgggcat 87300 

caccatcttt tagttaatat cgaagcattg ccagagagct ttattccttt tgacaagggt 873 60 

tatttacatt ttggaggagg tcaaactgaa acaattcttg atcttgatcc tggaacctat 87420 

Itctctccaac taattcttgg atcttatgtg cataattcaa aaatgcaggt aaataacttt 87480 

^aaaggtcaag gacccttttt atcagaaaaa ataacaatta cagttaatta gagattagac 87540 

Icagttataac tttatctaga tgttcatcta agttattagc aattatattt gcactagaaa 87600 

^taggaccaga attatcgtca taatcatcac caactacatt cacctttctt attatcccaa 87660 

= fcaagaccatg caacatagac caaagagtta tacatttaaa ggcaatcact tcctcaggct 87720 

-bgtctgctag attagcaaaa ctttttctca tattatcgta tgttccattt gcagatttaa 87780 

jaagatctgg gtaatcggca aagttaccaa cagctgttcc aaacatcaaa tcatatgtat 87840 

gtgcattttt taaaccaaac cctatatatt tgcttgcatt agtaacaagt tttttctttg 87900 

taattttttt tggattctca aaaaaaacaa gctcattaag ttttttaaag ccaacggttg 87960 

caacagcggc atatacacat tcttttgttt caaaatgcct ataaggagca gtttgagaaa 88020 

caccactttc ctttgcaagg gatctaatac ttagtttagt gtaaccatct ctatcacaaa 88080 

gcctgcatgc gcattctata agttcttcct ttaagtttcc gtgatgatag ttattcataa 88140 

tttaatttta atgttgacac tgctaacata tatcatgtta ttatgtatac accgcataca 88200 

ttaagttgat tacatattaa agtcaaatac agatatatga acataacaaa aataaataac 88260 



Page 63 



accgtcatag ctcttttatt aggcaatgcg tttctatcaa atcttgaggc tttagaagtt 88320 

cttgaggtta aaatgcttga tgaatatgct gtaactagag aatttcctgg aaagctcatt 88380 

ccatcagacc agtctaagct agcatttgaa atacctggaa agataaactc tattaatgtt 88440 

gatatcggag atgaggtcat cttaggggat gaacttgcct cattggatga tagagaagct 88500 

ttagcacaat taaatcaatc aaaggctaaa tttgatttag ctgaacaagt actagcaaga 88560 

tatataaatc tcagagcaga tgggcatatt tctattcaag atcttgataa ggctgaatct 88620 

gatcaaatag tagctaagtc gcagtatgat ttttatagag ttaaatttga gcaaactaag 88680 

ttactagctc cttttaatgg agttattcaa aatagatttc tagatacagg atcagtaatc 88740 

aatgcaggtg tccaagtttt agaaatttta ggctctagca atgttgaagc aagaatttct 88800 

attccaatga actatatgaa caaggttaag attggagatg agtatgaatt tgatatcaga 88860 

ggaatatcta caaaagctac gttagagaga ctggctccca tgtcaaccgg aggctccaat 8892 0 

=aataggttag caatttttag atttgatacc ttttttaatc caggatcaat agtaaagctc 88980 

^aaactaagca tcactgagaa agcaaaaggt acatgggttc caattaagtc actgtcccag 89040 

jtccgaacaag gtatttgggc catatatacc attaacgagc aacaagtagt tgttagagat 89100 

3cttgttgatg ttatttattt tgaagacgaa tatgcttttg tcagcggaac acttaataat 89160 

3ggtgatttag taattttagg cggagctcaa aaaattattg aaggaaaaat aataaaataa 89220 

-katgaatgtt attaattttc taatagaaaa gcctaggata ttatttctaa ctttagcatt 89280 

tatattactt tctggaattt cttctgggct ttcagttcca attcaagaaa accctgaact 89340 

ggctgagaga tggggaggtg ttcgtatttt tcttcccggg gcatccccag aaagaattga 89400 

aacagagata gtaaatgatc tagaaatcaa acttagagaa gttgaagaaa tcgatgagct 89460 

tgaatcaatt attactcaag gtttttcaac aattgtagtt gaattaaatc aaagtgtacc 89520 

tcctatactt attgaagaga cttggtccaa ggttcaagac aagctcaatc aaatagttat 89580 

tcctcaaggt gcagaaatat ttcttgatag aagcagtggt ccgcctatca ctgttcagta 89640 

tgctgtaacc tggaacggca gtggagatgc tccactaata atgatgtcca gactagcaag 89700 
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ccagctaaaa agaaaattaa gctcaatagg ctcatctcat caaactgcaa tttttggtga 89760 

aacagatgaa gaaattttaa ttgaactaga ttcatcaaag ctatcttcgc ttggattatc 8982 0 

atttcaagat atcgcaagtg ctattcaagc cctagatgca aaaaaaccta ttggtgtatc 89880 

ctcaaacaac aattctgagc ttttatatag actcaaagat aatatacaga gcattcaaaa 89940 

actctcagaa atacctatca aggttattaa taaatcagag atcatacagc tagatgatgt 90000 

ggcatttatt tcaaaaatcc cggtttctcc tattgaagac atattcttgt ttaatggaaa 90060 

tgtagttatt tctgttgctg gaaccggatc attttctcaa agagtccatg attatgtaga 90120 

acgcgcaaca attgttgtag atgagatgag agaaactctg ccgactgaga tcactataga 90180 

tttagtttat gacgaatctg cttacacaac taaaaaattt aatgagcttg taaaaagttt 90240 

ttcattagca atattttttg ttttagcttt aagtcttttt tttcttggaa ttagatcagc 90300 

Saataattgtt actcttatcc tgccattttc tatttgcctg gttatgattg gttgtaggtt 90360 

"^tataggctta cccttgcata tgacatctat cactggaatt attattgcac taggattgct 90420 

7'tatagataat gggattattg ttgttgaaga ttataagaat agaagagcat ctggattaaa 90480 

^tatcaatgat tcaatttcac aaggactaaa aaacttatgg gctccattag ctgctgcaac 90540 

"Sagcaacaacc gtcttctctt ttcttcctat tgttactgga gaagggtcga gtattgaatt 90600 

Tcgtaggcggc atggcaatga cagtcattat gtctataaca tcttcattag ttttggcgtt 90660 

:; Iattaatggtt ccagttctga tgagttatat ggaaaaaatt ccgttcttta aggatgtgga 90720 

tattagcaag gaagggtata gaaatgaaaa aatccttaat aaatataggg cctttttaaa 90780 

ctgggcgttc ttagttccta gaagagcaat catgatatcg cttgcattgc ctgttctagg 90840 

attctttctt tttaattctt tacctaaaga tttctttcct gctcaagata gagatatgtt 90900 

tagagttaat atagaactgc cttctaacgc ctcatcactt actacaatgc agagagttaa 90960 

ggaaattaga gaagatattc tagatagtga tttaatttca atagaaaaag attattcgtt 91020 

tatcggcaga atgatgccta gagttttgat gaatgttgtt ggtggagaag aaaaacaagg 91080 

atccaataat attgcgcagt ctgtattttt tgctactgat tattatgaaa tgattgaaaa 91140 
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ccttccagat ttatcaagaa gactggttaa aaataaccct gacattatag ttaatattga 91200 

tagtttctcg tctggccccc cggttttttc agatgttagt tatgtaattt ttggagatga 91260 

tccagattta ctaaaatcac ttggtgagga gctagagcta attattaaca attctcctga 91320 

tgtgagtctt acgaaatctg caacttcaaa ctcaataacc aatgttgagt ttgaacttaa 91380 

cagctcaaat atttcactgt ctggtcaaaa tgccaattat cttgtaaatg aaatgtttac 91440 

tgcaaacaat ggaatatttg ttggcactat gttggattca aacaaagaaa taccagtcag 91500 

gctgaaaggg ctgtctaata aaaacaatat tacgggaaat actagtttta taacaatgcc 91560 

ctctcaaggt ggttttgagt attttgatag tttcggaaaa agctcactaa caaacaaatc 91620 

gtcaacaatt actaggcttg atggccaaag aacaaatgat gttgagggct ggatttggac 91680 

aggtacgctc ccgtctgcta ctgaaaaatc tattaaaaaa gatgttaaag attttgaatt 91740 

jaagattgcca ataggctatt cattaaaaca acttggcgag gctgaaagca ggggccaatc 91800 

"tcaagcctca ttatactctt cagcttttat gtatttcatt cttataatag taggcttggt 91860 

Utatggcgctt aattctttca gagaggctgg tctaattttg tctgttgcat tcttatcaat 9192 0 

Mtggactatcc tttcttggtt tatttatagg ccagcaaaat tatggattta ttggaactat 91980 

~4aagtgcaata gggttaattg gcttatcaat aaatgattca attattgtct tatctcacat 92040 

%aaagaagag gctgagaaga aatcactaac caaagctgag cttgttgaag ttgttatcag 92100 

^tjtctacacgt catataatca ctacctcttt gacaacactt ggtggttttc ttccacttat 92160 

ttttgcaagt gtattcttca aaccgcttgc ttgggcaatg agcattggag tattaggcgc 92220 

gactattaca gccttattat atattcctgc aatgtttatg ataatgagaa aggttaagta 92280 

ctagaacaac tttccgagca ataatttctt cctagctttt ttacaataag tgattcatga 92340 

gtataggtct cgcaagaatc acatttaatc atagagttat caatcttttt tttatgaggt 92400 

cctgatatga gagatctgaa aaagaatacc gcagttatta tcaatataaa tatgaccaat 92460 

ggaatcaaaa gtaatatcga ttttaataag aacatttttt agtcactaga tgctggctta 92520 

ctatttctta taaaattcca agaccagcca gacttatctt cagatttttt tgattgctca 92580 
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tcttggtagt taatagaaat tattttttta gtatcttcaa gcaattcaat atatcccaaa 92 640 

gattcatatg aagcctctaa gatcttaaga gctctataat tttcactaga gttaggaata 92700 

ttttcaatta cataatttgc tcttcttatt gctgctatat gcgcatcaac actaacgtaa 92760 

taatcagctg ccgcaagctc atttcttgca atcatatttc ttaaatagat atttctttgt 92820 

ttagcatatg tagaatattg gctatcagga aatcttgtta agaattcagt tagttctgaa 92880 

aatgattcct tagctcctga gatatctcta tttgatagat cagtatctgt cattcttaca 92940 

ataaagctat tatctctcgt atagctagaa aggcctttca taaaatatgc ataatcaata 93 000 

tttggatgcc taggatgaag tcttataaat ttttctgctg cagcatgcgc agcctcagtt 93 060 

tcagcattca taaattgagc ataaataagc tctacttgcg cctgttcagc atatttgcca 93120 

aaaggatatc ttgattcaat tgcttctaat gaatcaatag caccaaaata attttttcca 93180 

jgccatccttc tttgggcttg atcgtaataa attttttcag gctgttctat ttctgggcca 93240 

1'tcagaattac aactaaccaa taacagagtt acaattggca atactataaa tagttttaaa 93300 

Tattagtttat ctttcattat ttgcacattc tacctgcatt atcgtaattt aagcttgaaa 93360 

liacatctattt agattaattt ttttataata agttcatatg attgttaaaa atgttccaaa 93420 

-ftgatctatct tcaatgaggc tagacaaagc tacagcagaa atgtttacag attattccag 93480 

^aactcagata aaaaaatgga tagaagaagg cagagttctt ataaatggag aggtatccca 93540 

^ccaagagat aaagtttatg agaatgatca gattgaatta agccctaaag aagaacaaaa 93600 

agtatcatgg gaagctcaag atatagattt tgaaattcac tttgaaaatg aagattttat 93660 

tataattaat aaacctgctg gtttaataat gcatccaggt tctggttgct atgatggaac 9372 0 

tctcgcaaat gggctcattt ataaatttcc agaattgatc aatattccaa ggtcgggaat 93780 

tgttcatcga ttagacaaag atacttctgg cattctgctc gtagcaagaa atgagtcttt 93840 

taggaacttt tttattaatg aaatgcagga gaggagagtt gtaaaaaaat atacgtctat 93900 

tgttattggt tctacactag gaagcttttc tatagaagag ccaattggaa gagataaaaa 93960 

taatagaacc aaaatggcaa ttcgagaaga tggcaaagat gctttaacat ttgtaaagct 94020 
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taaagaaaat 


attggaaact 


actctgtgtt 


ggatataaga 


atagagacag 


gaagaactca 


94080 


tcagattagg 


gtccatctat 


catcaaaaaa 


actaccaata 


attggagata 


aaacctatga 


94140 


cccaagcagg 


tctattgcaa 


gagatacccc 


tgaagagcta 


attaatatta 


tccgaggttt 


94200 


tccaaggcag 


gcattacatg 


caacacacct 


ctcattcaat 


gaccaaaaaa 


caaataatat 


94260 


tttttctttt 


gatattccca 


ctccaaatga 


tatggaggaa 


ctacttctag 


aattaagaaa 


94320 


attgatctaa 


tagtaactaa 


aaacttgttt 


tttgattaat 


aaaataatat 


aaaccttatt 


94380 


cctaagagtt 


ttttggtaag 


aaattgaaat 


tatctggcgc 


agacatgcta 


atgcaagcac 


94440 


ttcatgatga 


aggtgttgag 


ctaatctttg 


gctacccagg 


tggagccgcg 


cttcatatct 


94500 


atgatgcaat 


ttttagacaa 


gataaaatag 


atcatatttt 


agtaaggcat 


gagcaaggtg 


94560 


caacccatgc 


agcagatgga 


tattcaaggg 


cgacaggtaa 


gccaggagtt 


gtcttagtca 


94620 


yjcttctggacc 


tggtgcaaca 


aatgctatta 


caggaatcgc 


gactgcattt 


atggattcca 


94680 


jStaccaatggt 


agttatttca 


gggcaggttg 


ctagccattt 


aataggtact 


gatgcttttc 


94740 


tjlaagaaactga 


tatgattggt 


gtttcaagac 


caattgttaa 


gcatagctat 


acagttttta 


94800 


yyfettgctgaaga 


aatacctaag 


ataattaaag 


aagcttttta 


tgtcgcaact 


tcaggcagac 


94860 


Mstggacctgt 


tgttatagat 


atcccaaaag 


acatgacagc 


tccggataat 


ctttttgatt 


94920 


Mkctcgtatcc 


tgaagaagcc 


aagataagat 


catacaatcc 


tccgattgag 


ccagaaaaaa 


94980 


H^tcaaataga 


tagagcagtc 


gaagctatat 


tgatatcaaa 


aaaaccagtt 


atatatgctg 


95040 


gtggtggggc 


aattgctagt 


aatgccgaaa 


aagaattact 


tgaacttaat 


gaaattattg 


95100 


atgctcctgt 


tacaaatact 


ttaatgggat 


tgggtattta 


ccctgctagt 


catcatagat 


95160 


ttcttgggat 


gttagggatg 


catggaacat 


atcaggcaaa 


tatggcaatg 


cataatgcag 


95220 


cit- u LaaUaa L. 


+- rrr* faff rvf~vr* 

tyL laL uyy c 




atyaCayyaL 


uaccaa uaaa 


CCatCdaayc 




ttgcacctaa 


tgccaaagtg 


gttcatctag 


atgttgatca 


ctcatctgta 


tcaaaaatta 


95340 


tagaagcaaa 


tgtagctgtt 


tttgggcaag 


taaaaaattc 


cttaaaatta 


ataaaagaaa 


95400 


ctcttgaaaa 


aaaattagac 


tcttacgatt 


ctttcgctct 


tcagccttgg 


cacgatcaga 


95460 
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taaaagaatg gaaatcacta catggtttaa attatgagct ttataaagat gaatctgatg 95520 

atcatcccat tttaccccag gctgtagtcc agcatgtcca tgagattaca aatggggaag 95580 

catatgtgac ttccgatgtt ggtcagcatc aaatgtttgc tgctcaatat tatcattttg 95640 

ataagcctag aagatggatc aattctggtg gtctaggaac tatgggtttt ggtttgccag 95700 

cagcaatggg tgtaaaactc gcttttccaa aagatgaggt tgtttgcatt actggtgagg 95760 

gtagtatcca aatgtgcatc caagaattgt ctacatgtct tcaatataat ctcccaataa 95820 

aaataattaa tatcaataac gaagctcttg gtatggttaa acaatggcag gatatgaatt 95880 

atggaggaag gcactctgaa agtacctatc aaaactcgtt accggatttt ataaaactgg 95940 

ctgaatcata tggtcatata ggaattaaaa ttacaaaaaa ttctgattta agtgaaggct 96000 

taaaaaaagc ttttgaaatg aaagataaac ttgtctttgt tgatatttat gtagatcctt 96060 

Jcagagcatgt ttatccaatg caggttgcaa atggcagcct agaaaatatg tggctatcaa 96120 

"^aggatgaaca aacatgatta aaagaaaact aattttaatt atggaaaata aaccaggagc 96180 

^tctagtaaga gtagttggac tgtttcatca aagaggctac aacattgaaa cccttcatgt 96240 

%gatactgtt aaagactttt ctacttacaa atcgatattg aaaaaaaact tgaaaccaaa 96300 

;!tttgaggata atcaaatatc tagactgacc atagaaacaa tggtttcaga tgaccttatg 96360 

"!aggcagattt tgagacagct caataaatta atagatgtta tagctgtaag caatgaagag 96420 

Jcaacctatt taaaaggagt attattagat gaaaatttat tatgacgaag atgcaaacat 9 6480 

agaaattatt aaagggatga atgtctcaat aattggctat ggatctcaag ggaatgctca 96540 

tgcaaataat cttcatgaat ctggtgtaag tgttactgta ggtttaagag aagggtcttc 96600 

ttcatgggca aaggcagaag aagcaggctt aaaagttcaa acagttgctg attcggtaat 96660 

ccaagcagat ttggttatga ttttggcacc agatgaattc caaaaaaata tatatgaaac 96720 

cgaaatcaag ccaaacttaa aaaccagtgc aattcttgca tttgcacatg gctttaatat 96780 

tcattttgaa aaaatagttc ctgaagcaac taacagcgta attatgattg caccaaaagg 96840 

tccaggccat actgttagaa gtacttatac caatggtgga ggcgttccat ctctcatagc 96900 
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tatatatgaa gatgctttaa gtgatgaaga ttattcagca aaagatgtag ctctatctta 96960 

tgcaaaagca aatggcggca cgagggctgg tgttcttgaa acatctttta aagaagaaac 97 020 

agaaacagat ttatttggcg aacaagcagt tttatgtggc gggcttaccg ctttaattaa 97 080 

agctgggttt gaaactctag ttgaggcagg ctacagtgaa gagatggcat attttgaatg 97140 

ccttcatgaa acaaaactaa tcacagactt aattcaagaa ggtggcattg ctaatatgca 97200 

ttactcaata tcaaatactg ctgagtatgg tgattatgtg agtggaccca aagtaattac 972 60 

tagcgatacc aaaaaagcaa tgaaaggaat actagaaaat atacagtcag gaaaattcgc 9732 0 

agatgacttc ttgaatgact gtcgacaaag caatgatggc actggcggac ctgttatgaa 97380 

aagcaataga gaagctacaa aaattcatcc aatagagtcg gtaggggctg agctaaggtc 97440 

taaaatgaag ttcctaaatt cacaaaaatt ggtggataaa gaaattaatt aaaattaatt 97500 

tjaaaaaaaagg tatcttcttc ggaataactt cgttaagata cgcgtccgca caaagaggcg 97560 

Sgttgttcttt aaaaatattt ggttactcgt gtgggtgttc aaaatacgag aaaaaataat 97620 

flttagattttt tataaaaatc aacaaaacat gatattaatt gaagagtttg atcatggctc 97680 

^agattgaacg ctggcggtag gcttaacaca tgcaagtcgt gcgagaaagt atcttcggat 97740 

^atgagtagag cggcggacgg gtgagtaacg cgtaggaatc tacctagtag aaggggatag 97800 

Mcccggggaaa cccggattaa taccgtatac ctccttcggg agaaagaagg cctctctttg 97860 

rlaagctttcgc tactagatga gcctgcgtaa gattagcttg ttggtgaggt aaaggctcac 9792 0 

caaggcgacg atctttagct ggtctgagag gacgatcagc cacattggga ctgagacacg 97980 

gcccagactc ctacgggagg cagcagtggg gaatattgga caatgggcgc aagcctgatc 98040 

cagccatacc gcgtgtgtga agaaggcctt cgggttgtaa agcactttaa gcagggagaa 98100 

aaagttataa gttaatacct tataaccctg atgttacctg cagaataagc accggctaat 98160 

tccgtgccag cagccgcggt aatacggaag gtgcaagcgt taatcggaat tactgggcgt 98220 

aaagcgcgcg taggtggttt gttaagttgg atgtgaaagc cctgggctca acctaggaac 98280 

tgcatccaaa actaactcac tagagtacga tagagggagg tagaattcat agtgtagcgg 98340 
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tggaatgcgt agatattatg aagaatacca gtggcgaagg cggcctcctg gatctgtact 98400 

gacactgagg tgcgaaagcg tgggtagcga acaggattag ataccctggt agtccacgcc 98460 

gtaaacgatg acaactagct gttgggagac tatgtctttc agtggcgcag ctaacgcttt 9852 0 

aagttgtccg cctggggagt acggccgcaa ggctaaaact caaatgaatt gacggggacc 98580 

cgcacaagcg gtggagcatg tggtttaatt cgatgcaacg cgaaaaacct tacctactct 98640 

tgacatactt ggaggctctt gtaatgagag tgtgcctttt ggaaccaaga tacaggtgct 98700 

gcatggctgt cgtcagctcg tgtcgtgaga tgttccgtta agtcggataa cgagcgcaac 987 60 

ccttaccctt atttgccagc gattcggtcg ggaactataa ggggactgcc ggtgataaac 9882 0 

cggaggaagg tgaggacgac gtcaagtcat catggccctt acgagtaggg ctacacacgt 98880 

gctacaatgg gagatacaga cggacgctaa gccgcgaggt ggtgctaatc ctaaaaagtc 98940 

tttcgtagtc cggattggag tctgcaactc gactccatga agtcggaatc gctagtaatc 99000 

gcggatcagc atgccgcggt gaatacgttc tcgggtcttg tacacaccgc ccgtcacacc 99060 

atggaagtgg attgcaccag aagtagatag tctaacctta gggagggcgt ttaccacggt 99120 

gtgcttcatg actggggtga agtcgtaaca aggtagccgt aggggaacct gtggctggat 99180 

"cacctcctta acgataaatc gcgttttaaa cgcccacacg agtaatcaaa tattaaaaaa 99240 

"aagaacattt agatatgtaa aatcattggt atgtaatttt ctagtgtata catttatgta 99300 

-tacataagat cactgcaatt aaaaagtaac atatgcattt atgtgtatgt taaaaaagta 993 60 

attaatatat tttattaagt tactctcaaa aatgaagata aaacttcaaa aaaaatatgt 9942 0 

aacctttttt aaggttatat gatcaagtaa aggaagagca caaggcggat gccttggcag 99480 

cataaggcga tgaaggacgt aataacctgc gataagcctc ggggagctgg taaataagct 99540 

tcgatccgag gatttccgaa tgggaaaacc caatacacat aagtgtatta tcttatactg 99600 

aatacatagg tataagaggc aaacctaggg aactgaaaca tctaagtacc tagaggaaaa 99660 

gaaatcaaca gagattccgg tagtagcggc gagcgaaacc ggaccagccc ttaagcttat 99720 

tttagtccag caaaatattc tggaaagttt agccatagta ggtgatagcc ctgtatgcga 99780 
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100500 


cagccatcct 


ttaaagaaag 


egtaataget 


cactagtcga 


gtcggcctgc 


gtggaagata 


100560 


taacggggct 


aaatcataaa 


ccgaagctac 


agatcttaaa 


tttatttaag 


atggtagaag 


100620 


agcgttctgt 


aagcggttga 


aggtaagctg 


agaggegaac 


tggaegtate 


agaagtgega 


100680 



jatgttgacat gagtaacgat caaagaggtg aaaaacctct tegecgaaaa accaagggtt 100740 
3cctgtccaac gctaatcgag gcagggtgag gcggccccta aggegaggge gaaagccgta 100800 
gtcgatggga aacaggttaa tattcctgta ctttttataa ctgcgatggg gtgaeggaga 100860 
aggttagact ageaeggega cggttgtccg tgttcaaggt tgtaggctgg tgttctaggt 100920 
aaatccggaa egctaagget gagaactgat aacgaccact ctacgagtgg gaagtagtcg 100980 
ataccatget tccaggaaaa acctctaagc ttcaggttat aagaaaccgt accctaaacc 101040 
gacacaggtg gttaggtcga gtagaccaag gtgtttgaga gaactatggt gaaggaacta 101100 
ggcaaaatag caccgtaact tegggagaag gtgcgccgcg tttggtgatg agacttgetc 101160 
tctaagctga acgtggtcga agataccagg tggctgegae tgtttactaa aaacatagca 10122 0 
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ctctgcaaac tcgtaagagg aagtataggg 
ttgatggggt tagcttatgc gaagctcttg 
tataacggtc ctaaggtagc gaaattcctt 
gtaacgatgg ccacactgtc tccaccatag 
gcagtgtacc cgcagctaga cggaaagacc 
actttgacct tacttgtgta ggataggtgg 
tggagtcatc cttgaaatac cacccttgta 
tggatcaggg acagtgcgtg ctgggtagtt 
cggaggagta cgaaggtatc cttatcacgg 
aaggatgctt gactgcgaga tcgacggatc 
cggtggttct gaatggaagg gccatcgctc 
gctgataccg cccaagagtt catatcgacg 
1 cacatcctgg ggctggagca ggtcccaagg 
jcgagctgggt ttagaacgtc gtgagacagt 
latttgaggga agctgattct agtacgagag 
igttgtcacgc cagtggcatt gccgggtagc 
Jtataagcggg aagcctctcc caagattaaa 
tcgtcataga ctatgacgtt gataggcaag 
cttgtactaa taactcgtga ggcttgatca 
taaaacattg tagtgagaat taaaaaataa 
tagcaacttg gaaccacctg atcccatctc 
tggtagtgca gggtctccct gtgtgagagt 
ttccagttta ctggaagcct ttttttttat 
ttaacaggag gcattggttc tggcaaatct 



tgtgacgcct gcccggtgcc ggaaggttaa 101280 
atcgaagccc cggtaaacgg cggccgtaac 101340 
gtcgggtaag ttccgacctg cacgaatggc 101400 
actcagtgaa attgaaatcg ctgttaagat 101460 
ccgtgcacct ttactatagg ttcgcactgg 101520 
gagactttga agcagagacg ccagtctttg 101580 
agattgaagt tctaacctag gtccattatc 101640 
tgactggggc ggtctcctcc taaagagtaa 101700 
tcggacatcg tgaggtaagt ataaaggcag 1017 60 
gagcaggtag gaaactaggt cttagtgatc 101820 
aacggataaa aggtacgccg gggataacag 101880 
gcggtgtttg gcacctcgat gtcggctcat 101940 
gtatggctgt tcgccattta aagtggtacg 102 00 0 
tcggtcccta tctgctgtgg gcgtttggag 102060 
gaccgaattg gacgaacctc tggtgttccg 10212 0 
tatgttcgga aaggataacc gctgaaagca 102180 
tctcccagag actttatgtc tcctaaagag 102240 
atgtgtaagc gctgcgaggc gttgagctaa 1023 00 
tgtaacctta agcaaggttc ataatttgag 102360 
aaagttacat accagtttgc ctgatgacaa 10242 0 
gaactcagaa gtgaaacgag ttaacgccaa 102480 
aggaaatcgt caggcttttt tctttaaggc 102540 
ctcaagtata atacccagat gattattgga 102600 
gccgctgcag acttctttat tgatttaggt 102660 
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atatcagtct tagatgcaga tcaagttgct 
tatactgatt ttatttctca atttggtgaa 
aggctgaaat tgcgcgaact tattttttca 
attattcatc ccatagttcg gtctgctatt 
tattctattg ttatggtgcc actcattttt 
attattactg ttgactgtga tttagaactt 
caaaataaat cgcagattaa gaatattatt 
agtatttctg atgacgtact tatcaataac 
aatgttttac atactaaata tatggagtta 
acctgtcaaa ctttctactg acaatattta 
tatagatttt ggtgattggg ctaatgaaga 
agatttttac gaagattaaa tttaagaaag 
Iggctttttta tacttcattt cttgcgtatc 
itctgcctaac ttaggttcgt ttcttgtcac 
jitttttcattt tgaaagatat cagaattaat 
"factattttgt ttgtttatgc tctctagctc 
ftattcttata gtttcaacat ctatttccga 
tcttttgaat tcattttttg gatttttttg 
taaatgatct atttctgata aatgctcttt 
ctgcttctca agaagcaatc tattttcacc 
ttttgcttgc agaacaattt cttcggcaat 
aattttattg gctatatctg tttctagtcc 
cctacttttc cattgagact ctattgattc 
gaattgctgc tcgatgagtg actcaatggt 



aaagaagctt tatctacaaa ttctcctgga 10272 0 
gtgtatttaa ataataatcg tgaggttgat 10278 0 
aatccttcaa aaaaaaagga tcttgagaat 102840 
agtaatttta ttattacatc aacatcacca 102900 
gaaacaaatt catataaaaa ttacgataag 102960 
cagatagtaa gggcctcaag tagagatgct 103 02 0 
aataagcaag cctctagaga ggagaggcta 103080 
agcaccctat ctgatctaaa aaaacaagtt 103140 
ttaaatgagt agttgcccta gatgtgaaaa 103200 
tagacctttc tgctctgaga aatgcaaact 1032 60 
taataaaatt tcaagaccaa ttcaatctga 10332 0 
tctccattca ccactttcaa tcatcggttt 103380 
ttgaccgtta gtaattttta caagatcatt 103440 
agtacttgtt tttactatag gagtggcttg 103500 
ttcctctttt tccaatttta attcttgaga 103560 
agattctgtc gatatttgta atgagaaaag 103620 
aagcatagat tcaaacattg aatatgcctc 103680 
agcataagcc cttagaccaa cactatttct 103740 
ccaatggaca tcaagtactt gcagcataac 103800 
aagatcacta aacttttttg aatatttatt 103860 
tgtatttggt acaagttttt tattactatt 103920 
atagctctct tttaaataat catcgagctg 103980 
ttcaggtaca tataaattag atattccttt 104040 
actgctaata tcttcctctt ctagtaattg 104100 
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attccttaaa gaatatatag cttgtctttg atcatttgat acatcgtcgt actctaaaag 104160 
atttttcctt gcatcaaagt ttctgctctc aattcttttt tgcgcatttt ctattcctct 104220 
tgaaagcatt ttagcttcaa tatgatcatc tcccatgccc agcctttcaa aagtagccct 104280 
tcttccatcc gaaataaaaa gtctcaagag atcgtcttct aaagataaga agaattttga 104340 
ataacctgga tcaccttgtc tgcctgatct cccccttaac tgattatcta ttcttcttga 104400 
ttcatgcctt tctgttccaa gtatatgaag cccaccagat tctataactt ttttattatt 104460 
ttctttccac tctatatcgc tttggtcttc tttttttccg ccaagaacta tatccgtccc 104520 
ccttcctgcc atattcgtag caatagttac cattccaggt ttacctgcat tggcaattat 104580 
ctctgcttct ttttcgtgat gcttggcatt taaaatttga tgaggtattt ttttgttatt 104640 
taaatatgct gatacttctt ctgaagattc aaccgaaact gttccaacaa gaatgggaga 104700 
agattttttt cttaattgtt caatttcttc aattagagct ttatatttcg attctgttgt 104760 
aagaaatact aagtcattaa gatcagctct agccatagga acatttgttg ggatgatgat 104820 
[gacatttagg ccatagattt gactaaactc tactgcttca gtatctgctg ttcctgtcat 104880 
?.cccagaaagt tttttaaata atctaaaaaa gttttggaat gtggtggatg ctagtgtttg 104940 
fagactctctt tggatagcaa cattttcttt gcattccagt gcctggtgaa caccttcact 105000 
^cattcttctt ccgggcattg ttctacctgt atgctcatca atcaaaagaa cctcaccgtt 105060 
^cctaaccaaa taatccacat tctttttaaa taagaagctt gctctaagtg ttgcttgaac 105120 
aaatttcata atttttaaat tagaaacaga gtaagcccat ctgaggctcc aagccgattc 105180 
cage 105184 

<210> 2 

<211> 29 

<212> DNA 

<213> per primer 
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I 



<220> 

<221> oligo-nucleotide-primer 
<222> (1)..(29) 



<300> 

<301> Beja,0., Aravind,L., Koonin,E.V., Suzuki, M.T., Hadd,A., 
Nguyen, L. P., Jovanovich, S.B. , Gates, CM. , Feldman, R.A. , Spudich, J.L. , 
Spudich, E.N. and DeLong , E . F . 

<302> Bacterial rhodopsin: evidence for a new type of phototrophy in 
the sea 

<303> Science 
3 <304> 289 
S<305> 5486 
1<306> 1902-1906 
J<307> 2000-09-15 
3<309> - - 



=j<400> 2 

~accatgggta aattattact gatattagg 29 

<210> 3 

<211> 24 

<212> DNA 

<213> per primer 
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<220> 
<221> 
<222> 



oligo-nucleotide-primer 
(1) • • (24) 



<300> 

<301> Beja,0., Aravind,L., Koonin,E.V., Suzuki, M.T., Hadd,A., 
Nguyen, L. P., Jovanovich, S . B. , Gates, CM., Feldman, R. A. , Spudich,J.L 
Spudich, E .N. and DeLong,E.F. 

<302> Bacterial rhodopsin: evidence for a new type of phototrophy 
the sea 

<303> Science 

<304> 289 
i<305> 5486 
=<306> 1902-1906 
J<307> 2000-09-15 
^<3 09> - - 



^<400> 3 

=agcattagaa gattctttaa cage 

<210> 4 

<211> 750 

<212> DNA 

<213> Naturally occurring gamma proteobacterium 



<220> 
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<221> CDS 

<222> (1) . . (750) 

<223> light-driven proton pump; has the properties of a light-driven pr 
oton pump when expressed with retinal in Escherichia coli . Note 
that additional three nucleotide residues incorporated by per pri 
ming with reference to the original 31A08 DNA sequence (DNA resid 
ues 4-6, ggt), adding a new restriction site for cloning 



<300> 

<301> Beja,0., Aravind,L., Koonin,E.V., Suzuki, M.T., 
Hadd,A. , Nguyen , L . P . , Jovanovich, S . B . , Gates, CM. , 
Feldman,R.A. ,Spudich, J.L. , Spudich,E.N. and DeLong, E . F 

<3 02> Bacterial rhodopsin: evidence for a new type of phototrophy in 
the sea 

5<303> Science 

S<304> 289 

|<305> 5486 

jj<3 06> 1902-1906 

Qc307> 2000-09-15 

&308> AAG10475 

Q309> 2000-06-15 

<313> (1) . . (750) 



<400> 4 

atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca 48 
Met Gly Lys Leu Leu Leu lie Leu Gly Ser Val lie Ala Leu Pro Thr 
15 10 15 

ttt get gca ggt ggt ggt gac ctt gat get agt gat tac act ggt gtt 96 
Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 
20 25 30 
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tct ttt tgg tta gtt act get get tta tta gca tct act gta ttt ttc 144 
Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 
35 40 45 

ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tea tta act 192 
Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 

gta tct ggt ctt gtt act ggt att get ttc tgg cat tac atg tac atg 240 
Val Ser Gly Leu Val Thr Gly lie Ala Phe Trp His Tyr Met Tyr Met 
65 70 75 80 

aga ggg gta tgg att gaa act ggt gat teg cca act gta ttt aga tac 288 
Arg Gly Val Trp lie Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 

85 90 95 

att gat tgg tta eta aca gtt cct eta tta ata tgt gaa ttc tac tta 33 6 

lie Asp Trp Leu Leu Thr Val Pro Leu Leu lie Cys Glu Phe Tyr Leu 
100 105 110 

*3.att ctt get get gca act aat gtt get gga tea tta ttt aag aaa tta 384 

J^Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 
S 115 120 125 

SJcta gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gaa gca 432 

IjlLeu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 

U 130 135 140 

s^gga ate atg get gca tgg cct gca ttc att att ggg tgt tta get tgg 480 

GGly He Met Ala Ala Trp Pro Ala Phe He He Gly Cys Leu Ala Trp 

IIH45 150 155 160 

H?ta tac atg att tat gaa tta tgg get gga gaa gga aaa tct gca tgt 528 
Wal Tyr Met He Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys 
H ; 165 170 175 

aat act gca agt cct get gtg caa tea get tac aac aca atg atg tat 576 
Asn Thr Ala Ser Pro Ala Val Gin Ser Ala Tyr Asn Thr Met Met Tyr 
180 185 190 

att ate ate ttt ggt tgg gcg att tat cct gta ggt tat ttc aca ggt 624 
He He He Phe Gly Trp Ala He Tyr Pro Val Gly Tyr Phe Thr Gly 
195 200 205 

tac ctg atg ggt gac ggt gga tea get ctt aac tta aac ctt ate tat 672 
Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu He Tyr 
210 215 220 
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aac ctt get gac ttt gtt aac aag att eta ttt ggt tta att ata tgg 720 
Asn Leu Ala Asp Phe Val Asn Lys lie Leu Phe Gly Leu lie lie Trp 
225 230 235 240 

aat gtt get gtt aaa gaa tct tct aat get 750 
Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 5 

<211> 250 

<212> PRT 

<213> Naturally occurring gamma proteobacterium 

<400> 5 

[Met Gly Lys Leu Leu Leu lie Leu Gly Ser Val lie Ala Leu Pro Thr 
il 5 10 15 



;Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 
20 25 30 



iSer Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 
; 35 40 45 



'TPhe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 



Val Ser Gly Leu Val Thr Gly lie Ala Phe Trp His Tyr Met Tyr Met 
65 70 75 80 



Arg Gly Val Trp lie Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 

85 90 95 



lie Asp Trp Leu Leu Thr Val Pro Leu Leu lie Cys Glu Phe Tyr Leu 
100 105 110 
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lie Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 
115 120 125 



Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 
130 135 140 



Gly lie Met Ala Ala Trp Pro Ala Phe lie lie Gly Cys Leu Ala Trp 
145 150 155 160 



Val Tyr Met lie Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys 

165 170 175 



Asn Thr Ala Ser Pro Ala Val Gin Ser Ala Tyr Asn Thr Met Met Tyr 
180 185 190 



lie lie lie Phe Gly Trp Ala lie Tyr Pro Val Gly Tyr Phe Thr Gly 
195 200 205 



Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu lie Tyr 
210 215 220 



3 -Asn Leu Ala Asp Phe Val Asn Lys lie Leu Phe Gly Leu lie lie Trp 
1225 230 235 240 



JAsn Val Ala Val Lys Glu Ser Ser Asn Ala 
k 245 250 



<210> 6 

<211> 747 

<212> DNA 

<213> Naturally occurring gamma proteobacterium 
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<220> 



<221> 



CDS 



<222> 



(1) • . (747) 



<223> 



Native proteorhodpsion DNA sequence from BAC clone 31A08 



<300> 



<301> Beja,0., Aravind,L., Koonin,E.V., Suzuki, M.T., Hadd,A., 
Nguyen, L. P., Jovanovich, S . B . , Gates, CM., Feldman, R. A. , Spudich, J . L . , 
Spudich,E.N. and DeLong,E.F. 

<3 02> Bacterial rhodopsin: evidence for a new type of phototrophy in 
the sea 

<303> Science 
<304> 289 
<305> 5486 
<306> 1902-1906 
<307> 2000-09-15 
<309> - - 



<313> (1)..(747) 



<400> 6 

atg aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca ttt 48 
Met Lys Leu Leu Leu lie Leu Gly Ser Val lie Ala Leu Pro Thr Phe 
15 10 15 

get gca ggt ggt ggt gac ctt gat get agt gat tac act ggt gtt tct 96 
Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val Ser 



30 



ttt 
Phe 



tgg tta gtt act 
Trp Leu Val Thr 
35 



get get tta tta gca 
Ala Ala Leu Leu Ala 
40 



tct act gta ttt 
Ser Thr Val Phe 
45 



ttc 
Phe 



ttt 
Phe 



144 
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gtt gaa aga gat aga gtt tct a» a 

Val Glu Arg Asp Arg Val Ser lit r ^ *** * Ca tCa tta act S*a 
so P Arg val Ser Ala Lys Trp L ys Thr Ser Leu Thr Val 

bb 60 



Se" Ty S ^ Gly £ £ £ ^ ^ ^ taC «g aga 

65 70 Y I±e Ala Phe Tr P His Tyr Met Tyr Met Arg 



70 75 

/b 80 



ss E s ;s §s e is ss s s s e e s £ ;s 



90 95 



gat tgg tta eta aca gtt cct eta =f 3 4- 4. 

Asp Trp Leu ^ Ihr ^ Pro ^ ^ £ £ ttc tac tta att 



105 110 



2 E E E S E K E IS 2 £ £ S - - - 



120 125 



£ S S S £ £? S SS S IS - - - - lil IS 

135 140 



s s e e s s s s s: s: is - - - m % 

Z 2 E E I| 2 £ E IS E IS E S E £ 2 

170 175 

Thr S: S^r Pro Sa g" f ?f ^ " C aca ^ ^ tat att 

Pro Ala Val Gin Ser Ala Tyr Asn Thr Met Met Tyr lie 



185 19Q 



Se ill £ is % in £ £ £ IS £ fc £° ™ ca 39t tac 

195 ^ ro val G± y Tyr Phe Thr Gly Tyr 



200 205 



LeS Me^ G^y £ ^ S5 £ S Leu £ ^ ^ C " ^ ^ 

210 Y if J A±a LSU Asn Leu As n Leu He Tyr Asn 

Zlb 220 



£ S E S£ E E £ E Leu E IS E Si - £ - 

235 240 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 
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gtt get gtt aaa gaa tct tct aat get 
Val Ala Val Lys Glu Ser Ser Asn Ala 

245 



747 



<210> 7 
<211> 249 
<212> PRT 

<213> Naturally occurring gamma proteobacterium 
<400> 7 

Met Lys Leu Leu Leu lie Leu Gly Ser Val lie Ala Leu Pro Thr Phe 
15 10 15 



Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val Ser 
20 25 30 



Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe Phe 
35 40 45 



Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr Val 
50 55 60 



Ser Gly Leu Val Thr Gly lie Ala Phe Trp His Tyr Met Tyr Met Arg 
65 70 75 80 



Gly Val Trp lie Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr lie 

85 90 95 



Asp Trp Leu Leu Thr Val Pro Leu Leu lie Cys Glu Phe Tyr Leu lie 
100 105 110 



Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu Leu 
115 120 125 
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Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala Gly 
130 135 140 



lie Met Ala Ala Trp Pro Ala Phe lie lie Gly Cys Leu Ala Trp Val 
145 150 155 160 



Tyr Met lie Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys Asn 

165 170 175 



Thr Ala Ser Pro Ala Val Gin Ser Ala Tyr Asn Thr Met Met Tyr lie 
180 185 190 



lie lie Phe Gly Trp Ala lie Tyr Pro Val Gly Tyr Phe Thr Gly Tyr 
195 200 205 



Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu lie Tyr Asn 
210 215 220 



Leu Ala Asp Phe Val Asn Lys lie Leu Phe Gly Leu lie lie Trp Asn 
225 230 235 240 



Val Ala Val Lys Glu Ser Ser Asn Ala 

245 



<210> 8 

<211> 750 

<212> DNA 

<213> Naturally occurring gamma proteobacterium 
<220> 

<221> CDS 



Page 85 



<222> (1) . . (750) 

<223> proteorhodopsin variant from clone EBAC40 



<400> 8 

atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca 48 
Met Gly Lys Leu Leu Leu lie Leu Gly Ser Val lie Ala Leu Pro Thr 
15 10 15 

ttt get gca ggt ggt ggt gac ctt gat get agt gat tac act ggt gtt 96 
Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 
20 25 30 

tct ttt tgg tta gtt act get get eta tta gca tct act gta ttt ttc 144 
Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 
35 40 45 

ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tea tta act 192 
Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 

gta teg ggt ctt gtt act ggt att get ttc tgg cat tac atg tac atg 240 
Val Ser Gly Leu Val Thr Gly lie Ala Phe Trp His Tyr Met Tyr Met 
65 70 75 80 

aga ggg gta tgg att gag act ggt gat teg cca act gta ttt aga tac 288 
Arg Gly Val Trp He Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 

85 90 95 

att gat tgg tta eta aca gtt cct eta ttg ata tgt gaa ttc tac tta 336 
He Asp Trp Leu Leu Thr Val Pro Leu Leu He Cys Glu Phe Tyr Leu 
100 105 110 

att ctt get get gca aca aat gtt get get ggc ctg ttt aag aaa tta 384 
He Leu Ala Ala Ala Thr Asn Val Ala Ala Gly Leu Phe Lys Lys Leu 
115 120 125 

ttg gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gag gca 432 
Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 
130 135 140 

gga att atg aac get tgg ggt gca ttc gtt att ggg tgt tta get tgg 480 
Gly He Met Asn Ala Trp Gly Ala Phe Val He Gly Cys Leu Ala Trp 
145 150 155 160 

gta tac atg att tat gaa eta tgg get gga gaa ggc aag get gca tgt 52 8 
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Val Tyr Met lie Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ala Ala Cys 

165 170 175 



aat act gca agt cct get gtg caa tea get tac aac aca atg atg tat 
Asn Thr Ala Ser Pro Ala Val Gin Ser Ala Tyr Asn Thr Met Met Tyr 
180 185 190 

ata ate ate ttt ggt tgg gca att tat cct gta ggt tat ttc aca ggt 
He He He Phe Gly Trp Ala He Tyr Pro Val Gly Tyr Phe Thr Gly 
195 200 205 

tac eta atg ggt gac ggt gga tea get ctt aac tta aac ctt ate tat 
Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu He Tyr 
210 215 220 

gac ctt get gac ttt gtt aac aag att eta ttt ggt tta att ata tgg 
Asp Leu Ala Asp Phe Val Asn Lys He Leu Phe Gly Leu He He Trp 
225 230 235 240 

aat gtt get gtt aaa gaa tct tct aat get 
Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 9 
<211> 250 
<212> PRT 

<213> Naturally occurring gamma proteobacterium 
<400> 9 

Met Gly Lys Leu Leu Leu He Leu Gly Ser Val He Ala Leu Pro Thr 
15 10 15 



Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 
20 25 30 



Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 
35 40 45 



576 



624 



672 



720 



750 
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Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 



Val Ser Gly Leu Val Thr Gly lie Ala Phe Trp His Tyr Met Tyr Met 
65 70 75 80 



Arg Gly Val Trp lie Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 

85 90 95 



lie Asp Trp Leu Leu Thr Val Pro Leu Leu lie Cys Glu Phe Tyr Leu 
100 105 110 



lie Leu Ala Ala Ala Thr Asn Val Ala Ala Gly Leu Phe Lys Lys Leu 
115 120 125 



Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 
130 135 140 



Gly lie Met Asn Ala Trp Gly Ala Phe Val lie Gly Cys Leu Ala Trp 
145 150 155 160 



Val Tyr Met lie Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ala Ala Cys 

165 170 175 



Asn Thr Ala Ser Pro Ala Val Gin Ser Ala Tyr Asn Thr Met Met Tyr 
180 185 190 



lie lie lie Phe Gly Trp Ala lie Tyr Pro Val Gly Tyr Phe Thr Gly 
195 200 205 



Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu lie Tyr 
210 215 220 



Asp Leu Ala Asp Phe Val Asn Lys lie Leu Phe Gly Leu lie lie Trp 
225 230 235 240 
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Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 10 

<211> 750 

<212> DNA 

<213> Naturally occurring gamma proteobacterium 



<220> 

<221> CDS 

<222> (1)..(750) 

<223> proteorhodpsin variant from clone EBAC41 



<400> 10 

atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca 
Met Gly Lys Leu Leu Leu He Leu Gly Ser Val He Ala Leu Pro Thr 
15 10 15 

ttt get gca ggt ggt ggt gac ctt gat get agt gat tac act ggt gtt 
Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 

20 25 30 

tct ttt tgg tta get act get get tta tta gca tct act gta ttt ttc 
Ser Phe Trp Leu Ala Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 
35 40 45 

ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tea tta act 
Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 

gta tct ggt ctt gtt act ggt att get ttc tgg cat tac atg tac atg 
Val Ser Gly Leu Val Thr Gly He Ala Phe Trp His Tyr Met Tyr Met 
65 70 75 80 

aga ggg gta tgg att gaa act ggt gat teg cca act gta ttt aga tac 
Arg Gly Val Trp He Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 

85 90 95 



48 



96 



144 



192 



240 



288 
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att gat tgg tta eta aca gtt cct eta tta ata tgt gaa ttc tac tta 336 
lie Asp Trp Leu Leu Thr Val Pro Leu Leu lie Cys Glu Phe Tyr Leu 
100 105 110 

att ctt get get get act aat gtt get gga tea tta ttt aag aaa tta 384 
lie Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 
115 120 125 

eta gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gaa gca 432 
Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 
130 135 140 

gga ate atg get gca tgg cct gca ttc att att ggg tgt tta get tgg 480 
Gly lie Met Ala Ala Trp Pro Ala Phe lie lie Gly Cys Leu Ala Trp 
145 150 155 160 

gta tac atg att tat gaa eta tgg get gga gaa gga aaa tct gca tgt 52 8 

Val Tyr Met lie Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys 

165 170 175 

aat act gca agt cct get gtg caa tea get tac aac aca atg atg tat 576 
Asn Thr Ala Ser Pro Ala Val Gin Ser Ala Tyr Asn Thr Met Met Tyr 
180 185 190 

att ate ate ttt ggt tgg gcg att tat cct gta ggt tat ttc aca ggt 624 
lie lie lie Phe Gly Trp Ala lie Tyr Pro Val Gly Tyr Phe Thr Gly 
195 200 205 

tac ctg atg ggt gac ggt gga tea get ctt aac tta aac ctt ate tat 672 
Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu lie Tyr 
210 215 220 

aac ctt get gat ttt gtt aac aag att eta ttt ggt tta att ata tgg 720 
Asn Leu Ala Asp Phe Val Asn Lys lie Leu Phe Gly Leu lie lie Trp 
225 230 235 240 

aat gtt get gtt aaa gaa tct tct aat get 750 
Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 11 
<211> 250 
<212> PRT 



Page 90 



<213> Naturally occurring gamma proteobacterium 



<400> 11 

Met Gly Lys Leu Leu Leu lie Leu Gly Ser Val lie Ala Leu Pro Thr 
15 10 15 



Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 
20 25 30 



Ser Phe Trp Leu Ala Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 
35 40 45 



Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 



Val Ser Gly Leu Val Thr Gly lie Ala Phe Trp His Tyr Met Tyr Met 
65 70 75 80 



Arg Gly Val Trp lie Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 

85 90 95 



lie Asp Trp Leu Leu Thr Val Pro Leu Leu lie Cys Glu Phe Tyr Leu 
100 105 110 



lie Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 
115 120 125 



Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 
130 135 140 



Gly lie Met Ala Ala Trp Pro Ala Phe lie lie Gly Cys Leu Ala Trp 
145 150 155 160 



Val Tyr Met lie Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys 

165 170 175 
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Asn Thr Ala Ser Pro Ala Val Gin Ser Ala Tyr Asn Thr Met Met Tyr 
180 185 190 



lie lie lie Phe Gly Trp Ala lie Tyr Pro Val Gly Tyr Phe Thr Gly 
195 200 205 

Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu lie Tyr 
210 215 220 

Asn Leu Ala Asp Phe Val Asn Lys lie Leu Phe Gly Leu lie lie Trp 

225 230 235 240 

Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 

<210> 12 
<211> 750 
<212> DNA 

<213> Naturally occurring gamma proteobacterium 
<220> 

<221> CDS 

<222> (1) . . (750) 

<223> Proteorhodopsin variant from clone EBAC64 

<400> 12 

atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca 48 

Met Gly Lys Leu Leu Leu lie Leu Gly Ser Val lie Ala Leu Pro Thr 
15 10 15 

ttt get gca ggt ggc ggt gac ctt gat get agt gat tac act ggt gtt 96 
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Phe Ala Ala 



tct ttt tgg 
Ser Phe Trp 
35 

ttt gtt gaa 
Phe Val Glu 
50 

gta tct ggt 
Val Ser Gly 
65 

aga gga gta 
Arg Gly Val 



att gat tgg 
lie Asp Trp 



att ctt get 
lie Leu Ala 
115 

eta gtt ggt 
Leu Val Gly 
130 

gga att atg 
Gly He Met 
145 

gta tac atg 
Val Tyr Met 



aat act gca 
Asn Thr Ala 



ate ata gtc 
He He Val 
195 

tac eta atg 



Gly Gly Gly 
20 

tta gtt aca 
Leu Val Thr 



aga gat aga 
Arg Asp Arg 



ctt gtt act 
Leu Val Thr 
70 

tgg att gaa 
Trp He Glu 
85 

tta eta aca 
Leu Leu Thr 
100 

get gca act 
Ala Ala Thr 



tct ctt gtt 
Ser Leu Val 



gca get tgg 
Ala Ala Trp 
150 

att tat gaa 
He Tyr Glu 
165 

agt cct teg 
Ser Pro Ser 
180 

ttc ggt tgg 
Phe Gly Trp 



ggt gac ggt 



Asp Leu Asp 
25 

get get eta 
Ala Ala Leu 
40 

gtt tct gca 
Val Ser Ala 
55 

ggt att get 
Gly He Ala 



act ggt gat 
Thr Gly Asp 



gtt cct tta 
Val Pro Leu 
105 

aat gtt gec 
Asn Val Ala 
120 

atg ctt gtg 
Met Leu Val 
135 

cct gca ttc 
Pro Ala Phe 



eta tat get 
Leu Tyr Ala 



gtt caa tea 
Val Gin Ser 
185 

gca att tat 
Ala He Tyr 
200 

gga tea get 



Ala Ser Asp 



tta gca tct 
Leu Ala Ser 



aaa tgg aaa 
Lys Trp Lys 
60 

ttc tgg cat 
Phe Trp His 
75 

teg cct act 
Ser Pro Thr 
90 

tta ata tgt 
Leu He Cys 



ggc tea tta 
Gly Ser Leu 



ttt ggt tac 
Phe Gly Tyr 
140 

att att ggg 
He He Gly 
155 

gga gaa gga 
Gly Glu Gly 
170 

get tac aac 
Ala Tyr Asn 



cct ata ggt 
Pro He Gly 



ctt aac tta 



Tyr Thr Gly 
30 

act gta ttt 
Thr Val Phe 
45 

aca tea tta 
Thr Ser Leu 



tac atg tac 
Tyr Met Tyr 



gta ttt aga 
Val Phe Arg 
95 

gaa ttc tac 
Glu Phe Tyr 
110 

ttt aag aaa 
Phe Lys Lys 
125 

atg ggt gaa 
Met Gly Glu 



tgt tta get 
Cys Leu Ala 



aaa tct gca 
Lys Ser Ala 
175 

aca atg atg 
Thr Met Met 
190 

tat ttc aca 
Tyr Phe Thr 
205 

aac ctt att 



Val 



ttc 144 
Phe 



act 192 
Thr 



atg 240 

Met 

80 

tac 288 
Tyr 



tta 336 
Leu 



ctt 384 
Leu 



gca 432 
Ala 



tgg 480 

Trp 

160 

tgt 528 
Cys 



get 576 
Ala 



ggt 624 
Gly 



tat 672 
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Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu lie Tyr 
210 215 220 



aac ctt get gac ttt gtt aac aag att eta ttt ggt tta att ata tgg 720 
Asn Leu Ala Asp Phe Val Asn Lys lie Leu Phe Gly Leu lie lie Trp 
225 230 235 240 

aat gtt get gtt aaa gaa tct tct aat get 750 
Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 13 
<211> 250 
<212> PRT 

<213> Naturally occurring gamma proteobacterium 
<400> 13 

Met Gly Lys Leu Leu Leu lie Leu Gly Ser Val lie Ala Leu Pro Thr 
15 10 15 



Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 
20 25 30 



Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 
35 40 45 



Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 



Val Ser Gly Leu Val Thr Gly lie Ala Phe Trp His Tyr Met Tyr Met 
65 70 75 80 



Arg Gly Val Trp lie Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 

85 90 95 
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lie Asp Trp Leu Leu Thr Val Pro Leu Leu lie Cys Glu Phe Tyr Leu 
100 105 110 



lie Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 
115 120 125 



Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 
130 135 140 



Gly lie Met Ala Ala Trp Pro Ala Phe lie lie Gly Cys Leu Ala Trp 
145 150 155 160 



Val Tyr Met lie Tyr Glu Leu Tyr Ala Gly Glu Gly Lys Ser Ala Cys 

165 170 175 



Asn Thr Ala Ser Pro Ser Val Gin Ser Ala Tyr Asn Thr Met Met Ala 
180 185 190 



lie lie Val Phe Gly Trp Ala lie Tyr Pro lie Gly Tyr Phe Thr Gly 
195 200 205 



Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu lie Tyr 
210 215 220 



Asn Leu Ala Asp Phe Val Asn Lys lie Leu Phe Gly Leu lie lie Trp 
225 230 235 240 



Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 14 

<211> 750 

<212> DNA 

<213> Naturally occurring gamma proteobacterium 
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<220> 

<221> CDS 

<222> (1) . . (750) 

<223> Proteorhodopsin variant from per clone HOTOlm: GenBank# AF349978 



<400> 14 
atg ggt aaa 
Met Gly Lys 
1 

ttt get gca 
Phe Ala Ala 



tct ttt tgg 
Ser Phe Trp 
35 

ttt gtt gaa 
Phe Val Glu 
50 

gta teg ggt 
Val Ser Gly 
65 

aga ggg gta 
Arg Gly Val 



att gat tgg 
lie Asp Trp 



att ctt get 
lie Leu Ala 
115 

ttg gtt ggt 
Leu Val Gly 



tta tta ctg 
Leu Leu Leu 
5 

ggt ggt ggt 
Gly Gly Gly 
20 

tta gtt act 
Leu Val Thr 



aga gat aga 
Arg Asp Arg 



ctt gtt act 
Leu Val Thr 
70 

tgg att gag 
Trp lie Glu 
85 

tta eta aca 
Leu Leu Thr 
100 

get gca aca 
Ala Ala Thr 



tct ctt gtt 
Ser Leu Val 



ata tta ggt 
lie Leu Gly 



gac ctt gat 
Asp Leu Asp 
25 

get get eta 
Ala Ala Leu 
40 

gtt tct gca 
Val Ser Ala 
55 

ggt att get 
Gly lie Ala 



acc ggt gat 
Thr Gly Asp 



gtt cct eta 
Val Pro Leu 
105 

aat gtt get 
Asn Val Ala 
120 

atg ctt gtg 
Met Leu Val 



agt gtt att 
Ser Val lie 
10 

get agt gat 
Ala Ser Asp 



tta gca tct 
Leu Ala Ser 



aaa tgg aaa 
Lys Trp Lys 
60 

ttc tgg cat 
Phe Trp His 
75 

teg cca act 
Ser Pro Thr 
90 

ttg ata tgt 
Leu lie Cys 



get ggc ctg 
Ala Gly Leu 



ttt ggt tac 
Phe Gly Tyr 



gca ctt cct 
Ala Leu Pro 
15 

tac act ggt 
Tyr Thr Gly 
30 

act gta ttt 
Thr Val Phe 
45 

aca tea tta 
Thr Ser Leu 



tac atg tac 
Tyr Met Tyr 



gta ttt aga 
Val Phe Arg 
95 

gaa ttc tac 
Glu Phe Tyr 
110 

ttt aag aaa 
Phe Lys Lys 
125 

atg ggt gag 
Met Gly Glu 



aca 48 
Thr 



gtt 96 
Val 



ttc 144 
Phe 



act 192 
Thr 



atg 240 

Met 

80 

tac 288 
Tyr 



tta 336 
Leu 



tta 384 
Leu 



gca 432 
Ala 
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130 135 140 

gga att atg aac get tgg ggt gca ttc gtt att ggg tgt tta get tgg 
Gly He Met Asn Ala Trp Gly Ala Phe Val He Gly Cys Leu Ala Trp 
145 150 155 160 

gta tac atg att tat gaa eta tgg get gga gaa ggc aag get gca tgt 
Val Tyr Met He Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ala Ala Cys 

165 170 175 

aat act gca agt cct get gtg caa tea get tac aac aca atg atg tat 
Asn Thr Ala Ser Pro Ala Val Gin Ser Ala Tyr Asn Thr Met Met Tyr 
180 185 190 

ata ate ate ttt ggt tgg gca att tat cct gta ggt tat ttc aca ggt 
He He He Phe Gly Trp Ala He Tyr Pro Val Gly Tyr Phe Thr Gly 
195 200 205 

tac eta atg ggt gac ggt gga tea get ctt aac tta aac ctt ate tat 
Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu He Tyr 
210 215 220 

aac ctt get gac ttt gtt aac aag att eta ttt ggt tta att ata tgg 
Asn Leu Ala Asp Phe Val Asn Lys He Leu Phe Gly Leu He He Trp 
225 230 235 240 

aat gtt get gtt aaa gaa tct tct aat get 
Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 15 

<211> 250 

<212> PRT 

<213> Naturally occurring gamma proteobacterium 



480 



528 



576 



624 



672 



720 



750 



<400> 15 



Met Gly Lys Leu Leu Leu He Leu Gly Ser Val He Ala Leu Pro Thr 

15 10 15 



Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 
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20 



25 



30 



Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 
35 40 45 



Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 



Val Ser Gly Leu Val Thr Gly He Ala Phe Trp His Tyr Met Tyr Met 
65 70 75 80 



Arg Gly Val Trp He Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 

85 90 95 



He Asp Trp Leu Leu Thr Val Pro Leu Leu He Cys Glu Phe Tyr Leu 
100 105 HO 



He Leu Ala Ala Ala Thr Asn Val Ala Ala Gly Leu Phe Lys Lys Leu 
115 120 125 

Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 
130 135 140 



Gly He Met Asn Ala Trp Gly Ala Phe Val He Gly Cys Leu Ala Trp 
145 150 155 160 



Val Tyr Met He Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ala Ala Cys 

165 170 175 



Asn Thr Ala Ser Pro Ala Val Gin Ser Ala Tyr Asn Thr Met Met Tyr 
180 185 190 



He He He Phe Gly Trp Ala He Tyr Pro Val Gly Tyr Phe Thr Gly 
195 200 205 



Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu He Tyr 



Page 98 



210 215 220 

Asn Leu Ala Asp Phe Val Asn Lys lie Leu Phe Gly Leu lie lie Trp 
225 230 235 240 

Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 

<210> 16 
<211> 753 
<212> DNA 

<213> Naturally occurring gamma prtoeobacterium 
<220> 

<221> CDS 

<222> (1)..(753) 

<223> Proteorhodopsin variant from per clone HOT75ml: GenBank#AF349979 



<400> 16 

atg ggt aaa tta tta ctg ata tta ggt agt get att gca ctt cca tea 48 
Met Gly Lys Leu Leu Leu lie Leu Gly Ser Ala lie Ala Leu Pro Ser 
15 10 15 

ttt get get get ggt ggc gat eta gat ata agt gat act gtt ggt gtt 96 
Phe Ala Ala Ala Gly Gly Asp Leu Asp lie Ser Asp Thr Val Gly Val 
20 25 30 

tea ttc tgg ctg gtt aca get ggt atg tta gcg gca act gtg ttc ttt 144 
Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 
35 40 45 

ttt gta gaa aga gac caa gtc age get aag tgg aaa act tea ctt get 192 
Phe Val Glu Arg Asp Gin Val Ser Ala Lys Trp Lys Thr Ser Leu Ala 
50 55 60 
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gta tct ggt 
Val Ser Gly 
65 

aga ggt gtt 
Arg Gly Val 



att gat tgg 
lie Asp Trp 



att ctt get 
lie Leu Ala 
115 

eta get ggt 
Leu Ala Gly 
130 

gga tta get 
Gly Leu Ala 
145 

tta tac atg 
Leu Tyr Met 



agt act gca 
Ser Thr Ala 



att att gtt 
He He Val 
195 

tac eta atg 
Tyr Leu Met 
210 

tat aac ctt 
Tyr Asn Leu 
225 

tgg aat gtt 
Trp Asn Val 



tta att act 
Leu He Thr 
70 

tgg ata gac 
Trp He Asp 
85 

tta tta act 
Leu Leu Thr 
100 

get tgt aca 
Ala Cys Thr 



tea tta gta 
Ser Leu Val 



cct gta tta 
Pro Val Leu 
150 

att tat gag 
He Tyr Glu 
165 

agt cct get 
Ser Pro Ala 
180 

gtt gga tgg 
Val Gly Trp 



ggt ggc gaa 
Gly Gly Glu 



gec gac ctt 
Ala Asp Leu 
230 

get gtt aaa 
Ala Val Lys 
245 



ggt ata get 
Gly He Ala 



act ggt gat 
Thr Gly Asp 



gtt cca tta 
Val Pro Leu 
105 

agt gtt get 
Ser Val Ala 
120 

atg tta ggt 
Met Leu Gly 
135 

cct get ttc 
Pro Ala Phe 



eta tat atg 
Leu Tyr Met 



gtt aac tct 
Val Asn Ser 
185 

gca att tat 
Ala He Tyr 
200 

ggt gta tac 
Gly Val Tyr 
215 

gtt aac aag 
Val Asn Lys 



gaa tct tct 
Glu Ser Ser 



ttt tgg cat 
Phe Trp His 
75 

ace cca aca 
Thr Pro Thr 
90 

caa atg gtt 
Gin Met Val 



get tea tta 
Ala Ser Leu 



get gga ttt 
Ala Gly Phe 
140 

att att ggt 
He He Gly 
155 

ggt gaa ggt 
Gly Glu Gly 
170 

gca tac aac 
Ala Tyr Asn 



cct get gga 
Pro Ala Gly 



get tea aac 
Ala Ser Asn 
220 

att eta ttt 
He Leu Phe 
235 

aat get 
Asn Ala 
250 



tat etc tat 
Tyr Leu Tyr 



gta ttc aga 
Val Phe Arg 
95 

gag ttc tat 
Glu Phe Tyr 
110 

ttt aag aag 
Phe Lys Lys 
125 

gca ggc gaa 
Ala Gly Glu 



atg get gga 
Met Ala Gly 



aag get get 
Lys Ala Ala 
175 

gca atg atg 
Ala Met Met 
190 

tat get get 
Tyr Ala Ala 
205 

tta aac ctt 
Leu Asn Leu 



ggt ttg ate 
Gly Leu He 



atg 240 

Met 

80 

tat 288 
Tyr 



eta 336 
Leu 



ctt 384 
Leu 



get 432 
Ala 



tgg 480 

Trp 

160 

gta 528 
Val 



atg 576 
Met 



ggt 624 
Gly 



ata 672 
He 



att 720 

He 

240 

753 
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<210> 17 
<211> 251 
<212> PRT 

<213> Naturally occurring gamma prtoeobacterium 
<400> 17 

Met Gly Lys Leu Leu Leu lie Leu Gly Ser Ala lie Ala Leu Pro Ser 
15 10 15 



Phe Ala Ala Ala Gly Gly Asp Leu Asp lie Ser Asp Thr Val Gly Val 
20 25 30 



Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 
35 40 45 



Phe Val Glu Arg Asp Gin Val Ser Ala Lys Trp Lys Thr Ser Leu Ala 
50 55 60 



Val Ser Gly Leu lie Thr Gly lie Ala Phe Trp His Tyr Leu Tyr Met 
65 70 75 80 



Arg Gly Val Trp lie Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 

85 90 95 



lie Asp Trp Leu Leu Thr Val Pro Leu Gin Met Val Glu Phe Tyr Leu 
100 105 110 



lie Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 
115 120 125 



Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 
130 135 140 
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Gly Leu Ala Pro Val Leu Pro Ala Phe lie lie Gly Met Ala Gly Trp 
145 150 155 160 



Leu Tyr Met lie Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 

165 170 175 



Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Met 
180 185 190 



lie lie Val Val Gly Trp Ala lie Tyr Pro Ala Gly Tyr Ala Ala Gly 
195 200 205 



Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu lie 
210 215 220 



Tyr Asn Leu Ala Asp Leu Val Asn Lys lie Leu Phe Gly Leu lie lie 
225 230 235 240 



Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 18 

<211> 753 

<212> DNA 

<213> Naturally occurring gamma proteobacterium 



<220> 

<221> CDS 

<222> (1)..(753) 

<223> Proteorhodopsin variant from per clone HOT75m3; GenBank#AF349980 
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<400> 18 

atg ggt aaa tta tta ctg ata tta ggt agt get att gca ctt cca tea 
Met Gly Lys Leu Leu Leu He Leu Gly Ser Ala He Ala Leu Pro Ser 
15 10 15 

ttt get get get ggt ggc gat eta gat ata agt gat act gtt ggt gtt 
Phe Ala Ala Ala Gly Gly Asp Leu Asp He Ser Asp Thr Val Gly Val 
20 25 30 

tea ttc tgg ctg gtt aca get ggt atg tta gcg gca act gta ttc ttt 
Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 
35 40 45 

ttt gta gaa aga gac caa gtc age get aag tgg aaa act tea ctt act 
Phe Val Glu Arg Asp Gin Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 

gta tct ggt tta att act ggt ata get ttt tgg cat tat etc tac atg 
Val Ser Gly Leu He Thr Gly He Ala Phe Trp His Tyr Leu Tyr Met 
65 70 75 80 

aga ggt gtt tgg ata gat act ggt gat aca cca aca gta ttt aga tat 
Arg Gly Val Trp He Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 

85 90 95 

att gat tgg tta tta act gtt cca tta caa atg gtt gag ttc tat eta 
He Asp Trp Leu Leu Thr Val Pro Leu Gin Met Val Glu Phe Tyr Leu 
100 105 HO 

att ctt get get tgt aca agt gtt get get tea tta ttt aag aag ctt 
He Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 
115 120 125 

eta get ggt tea tta gta atg tta ggt get gga ttt gca ggc gaa get 
Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 
130 135 140 

ggt tta get cct gta tta cct get ttc att att ggt atg get gga tgg 
Gly Leu Ala Pro Val Leu Pro Ala Phe He He Gly Met Ala Gly Trp 
145 150 155 160 

tta tac atg att tat gag eta cat atg ggt gaa ggt aag get get gta 
Leu Tyr Met He Tyr Glu Leu His Met Gly Glu Gly Lys Ala Ala Val 

165 170 175 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 
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624 



672 



720 



agt act gca agt cct get gtt aac tct gca tac aac gca atg atg aag 576 
Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Lys 
180 185 190 

att att gtt att gga tgg gca att tat cct get gga tat get get ggt 
He He Val He Gly Trp Ala He Tyr Pro Ala Gly Tyr Ala Ala Gly 
195 200 205 

tac eta atg agt ggt gac ggt gta tac get tea aac tta aac ctt ata 
Tyr Leu Met Ser Gly Asp Gly Val Tyr Ala Ser Asn Leu Asn Leu He 
210 215 220 

tat aac ctt get gac ttt gtt aac aag att eta ttt ggt ttg ate att 
Tyr Asn Leu Ala Asp Phe Val Asn Lys He Leu Phe Gly Leu He He 
225 230 235 240 

tgg aat gtt get gtt aaa gaa tct tct aat get 753 
Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 19 
<211> 251 
<212> PRT 

<213> Naturally occurring gamma proteobacterium 

<400> 19 

Met Gly Lys Leu Leu Leu He Leu Gly Ser Ala He Ala Leu Pro Ser 
15 10 15 



Phe Ala Ala Ala Gly Gly Asp Leu Asp He Ser Asp Thr Val Gly Val 
20 25 30 



Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 
35 40 45 



Phe Val Glu Arg Asp Gin Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 
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Val Ser Gly Leu lie Thr Gly lie Ala Phe Trp His Tyr Leu Tyr Met 
65 70 75 80 



Arg Gly Val Trp He Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 

85 90 95 



He Asp Trp Leu Leu Thr Val Pro Leu Gin Met Val Glu Phe Tyr Leu 
100 105 110 



He Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 
115 120 125 



Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 
130 135 140 



Gly Leu Ala Pro Val Leu Pro Ala Phe He He Gly Met Ala Gly Trp 
145 150 155 160 



Leu Tyr Met He Tyr Glu Leu His Met Gly Glu Gly Lys Ala Ala Val 

165 170 175 



Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Lys 
180 185 190 



He He Val He Gly Trp Ala He Tyr Pro Ala Gly Tyr Ala Ala Gly 
195 200 205 



Tyr Leu Met Ser Gly Asp Gly Val Tyr Ala Ser Asn Leu Asn Leu He 
210 215 220 



Tyr Asn Leu Ala Asp Phe Val Asn Lys He Leu Phe Gly Leu He He 
225 230 235 240 



Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 
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<210> 20 

<211> 753 

<212> DNA 

<213> Naturally occurring gamma proteobacterium 
<220> 

<221> CDS 

<222> (1) . . (753) 

<223> Proteorhodopsin variant from per clone HOT75m4; GenBank #AF349981 



<400> 20 

atg ggt aaa tta tta ctg ata tta ggt agt get att gca ctt cca tea 48 
Met Gly Lys Leu Leu Leu lie Leu Gly Ser Ala lie Ala Leu Pro Ser 
15 10 15 

ttt get get get ggt ggc gat eta gat ata agt gat act gtt ggt gtt 96 
Phe Ala Ala Ala Gly Gly Asp Leu Asp lie Ser Asp Thr Val Gly Val 
20 25 30 

tea ttc tgg ctg gtt aca get ggt atg tta gcg gca act gtg ttc ttt 144 
Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 
35 40 45 

ttt gta gaa aga gac caa gtc age get aag tgg aaa act tea ctt act 192 
Phe Val Glu Arg Asp Gin Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 

gta tct ggt tta att act ggt ata get ttt tgg cat tat etc tat atg 240 
Val Ser Gly Leu lie Thr Gly lie Ala Phe Trp His Tyr Leu Tyr Met 
65 70 75 80 

aga ggt gtt tgg ata gac act ggt gat ace cca aca gta ttc aga tat 2 88 

Arg Gly Val Trp lie Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 

85 90 95 

att gat tgg tta tta act gtt cca tta caa gtg gtt gag ttc tat eta 336 
lie Asp Trp Leu Leu Thr Val Pro Leu Gin Val Val Glu Phe Tyr Leu 



Page 106 



100 



105 



110 



att ctt get 
lie Leu Ala 
115 

eta get ggt 
Leu Ala Gly 
130 

gga tta get 
Gly Leu Ala 
145 

tta tac atg 
Leu Tyr Met 



agt act gca 
Ser Thr Ala 



att att gtt 
lie lie Val 
195 

tac eta atg 
Tyr Leu Met 
210 

tat aac ctt 
Tyr Asn Leu 
225 

tgg aat gtt 
Trp Asn Val 



get tgt aca 
Ala Cys Thr 



tea tta gta 
Ser Leu Val 



cct gta tta 
Pro Val Leu 
150 

att tat gag 
lie Tyr Glu 
165 

agt cct get 
Ser Pro Ala 
180 

gtt gga tgg 
Val Gly Trp 



ggt ggc gaa 
Gly Gly Glu 



get gac ttt 
Ala Asp Phe 
230 

get gtt aaa 
Ala Val Lys 
245 



agt gtt get 
Ser Val Ala 
120 

atg tta ggt 
Met Leu Gly 
135 

cct get ttc 
Pro Ala Phe 



eta tat atg 
Leu Tyr Met 



gtt aac tct 
Val Asn Ser 
185 

gca att tat 
Ala lie Tyr 
200 

ggt gta tac 
Gly Val Tyr 
215 

gtt aac aag 
Val Asn Lys 



gaa tct tct 
Glu Ser Ser 



get tea tta 
Ala Ser Leu 



get gga ttt 
Ala Gly Phe 
140 

att att ggt 
lie lie Gly 
155 

ggt gaa ggt 
Gly Glu Gly 
170 

gca tac aac 
Ala Tyr Asn 



cct get gga 
Pro Ala Gly 



get tea aac 
Ala Ser Asn 
220 

att eta ttt 
lie Leu Phe 
235 

aat get 
Asn Ala 
250 



ttt aag aag 
Phe Lys Lys 
125 

gca ggc gaa 
Ala Gly Glu 



atg get gga 
Met Ala Gly 



aag get get 
Lys Ala Ala 
175 

gca atg atg 
Ala Met Met 
190 

tat get get 
Tyr Ala Ala 
205 

tta aac ctt 
Leu Asn Leu 



ggt ttg ate 
Gly Leu lie 



ctt 384 
Leu 



get 432 
Ala 



tgg 480 

Trp 

160 

gta 528 
Val 



atg 576 
Met 



ggt 624 
Gly 



ata 672 
He 



att 720 

He 

240 

753 



<210> 21 
<211> 251 
<212> PRT 



<213> Naturally occurring gamma proteobacterium 
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<400> 21 



Met Gly Lys Leu Leu Leu lie Leu Gly Ser Ala lie Ala Leu Pro Ser 
15 10 15 



Phe Ala Ala Ala Gly Gly Asp Leu Asp lie Ser Asp Thr Val Gly Val 
20 25 30 



Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 

35 40 45 



Phe Val Glu Arg Asp Gin Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 



Val Ser Gly Leu lie Thr Gly lie Ala Phe Trp His Tyr Leu Tyr Met 

65 70 75 80 



Arg Gly Val Trp lie Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 

85 90 95 



lie Asp Trp Leu Leu Thr Val Pro Leu Gin Val Val Glu Phe Tyr Leu 
100 105 110 



lie Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 
115 120 125 



Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 
130 135 140 



Gly Leu Ala Pro Val Leu Pro Ala Phe lie lie Gly Met Ala Gly Trp 

145 150 155 160 



Leu Tyr Met He Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 

165 170 175 
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Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Met 
180 185 190 



lie lie Val Val Gly Trp Ala lie Tyr Pro Ala Gly Tyr Ala Ala Gly 
195 200 205 

Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu lie 
210 215 220 

Tyr Asn Leu Ala Asp Phe Val Asn Lys lie Leu Phe Gly Leu lie lie 
225 230 235 240 

Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 

<210> 22 
<211> 753 
<212> DNA 

<213> Naturally occurring gamma proteobacterium 
<220> 

<221> CDS 

<222> (1) . . (753) 

<223> Proteorhodopsin variant from per clone HOT75m8: GenBank#AF349982 



<400> 22 

atg ggt aaa tta tta ctg ata tta ggt agt get att gca ctt cca tea 48 
Met Gly Lys Leu Leu Leu lie Leu Gly Ser Ala lie Ala Leu Pro Ser 
15 10 15 

ttt get get get ggt ggc gat eta gat ata agt gat act gtt ggt gtt 96 
Phe Ala Ala Ala Gly Gly Asp Leu Asp lie Ser Asp Thr Val Gly Val 
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20 



25 



30 



tea ttc tgg 
Ser Phe Trp 
35 

ttt gta gaa 
Phe Val Glu 
50 

gta tct ggt 
Val Ser Gly 
65 

aga ggt gtt 
Arg Gly Val 



att gat tgg 
I He Asp Trp 



att ctt get 
| He Leu Ala 
115 

J eta get ggt 
Leu Ala Gly 
130 

gga ttg get 
Gly Leu Ala 
145 

tta tac atg 
Leu Tyr Met 



agt act gca 
Ser Thr Ala 



att att gtt 
He He Val 
195 

tac eta atg 
Tyr Leu Met 



ctg gtt aca 
Leu Val Thr 



aga gac caa 
Arg Asp Gin 



tta att act 
Leu lie Thr 
70 

tgg ata gac 
Trp He Asp 
85 

tta tta act 
Leu Leu Thr 
100 

get tgt aca 
Ala Cys Thr 



tea tta gta 
Ser Leu Val 



cct gta tgg 
Pro Val Trp 
150 

att tat gag 
He Tyr Glu 
165 

agt cct get 
Ser Pro Ala 
180 

gtt gga tgg 
Val Gly Trp 



ggt ggc gaa 
Gly Gly Glu 



get ggt atg 
Ala Gly Met 
40 

gtc age get 
Val Ser Ala 
55 

ggt ata get 
Gly He Ala 



act ggt gat 
Thr Gly Asp 



gtt cca tta 
Val Pro Leu 
105 

aat gtt get 
Asn Val Ala 
120 

atg tta ggt 
Met Leu Gly 
135 

cct get ttc 
Pro Ala Phe 



eta tat atg 
Leu Tyr Met 



gtt aac tct 
Val Asn Ser 
185 

gca att tat 
Ala He Tyr 
200 

ggt gta tac 
Gly Val Tyr 



tta gcg gca 
Leu Ala Ala 



aag tgg aaa 
Lys Trp Lys 
60 

ttt tgg cat 
Phe Trp His 
75 

ace cca aca 
Thr Pro Thr 
90 

caa atg gtt 
Gin Met Val 



get tea tta 
Ala Ser Leu 



get gga ttt 
Ala Gly Phe 
140 

att att ggt 
He He Gly 
155 

ggt gaa ggt 
Gly Glu Gly 
170 

gca tac aac 
Ala Tyr Asn 



cct get gga 
Pro Ala Gly 



get tea aac 
Ala Ser Asn 



act gtg ttc 
Thr Val Phe 
45 

act tea ctt 
Thr Ser Leu 



tat etc tat 
Tyr Leu Tyr 



gta ttc aga 
Val Phe Arg 
95 

gag ttc tat 
Glu Phe Tyr 
110 

ttt aag aag 
Phe Lys Lys 
125 

gca ggc gaa 
Ala Gly Glu 



atg get gga 
Met Ala Gly 



aag get get 
Lys Ala Ala 
175 

gca atg atg 
Ala Met Met 
190 

tat get get 
Tyr Ala Ala 
205 

tta aac ctt 
Leu Asn Leu 



ttt 144 
Phe 



act 192 
Thr 



atg 240 

Met 

80 

tat 288 
Tyr 



eta 336 
Leu 



ctt 384 
Leu 



get 432 
Ala 



tgg 480 

Trp 

160 

gta 528 
Val 



gtg 576 
Val 



ggt 624 
Gly 



ata 672 
He 
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210 215 220 

tat aac ctt gcc gac ctt gtt aac aag att eta ttt ggt ttg ate att 720 
Tyr Asn Leu Ala Asp Leu Val Asn Lys lie Leu Phe Gly Leu lie lie 
225 230 235 240 

tgg aat gtt get gtt aaa gaa tct tct aat get 753 
Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 23 

<211> 251 

<212> PRT 

<213> Naturally occurring gamma proteobacterium 

<400> 23 

Met Gly Lys Leu Leu Leu lie Leu Gly Ser Ala lie Ala Leu Pro Ser 
1 5 10 15 



Phe Ala Ala Ala Gly Gly Asp Leu Asp lie Ser Asp Thr Val Gly Val 
20 25 30 



; Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 
> 35 40 45 



Phe Val Glu Arg Asp Gin Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 



Val Ser Gly Leu lie Thr Gly lie Ala Phe Trp His Tyr Leu Tyr Met 
65 70 75 80 



Arg Gly Val Trp lie Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 

85 90 95 



lie Asp Trp Leu Leu Thr Val Pro Leu Gin Met Val Glu Phe Tyr Leu 
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100 



105 



110 



lie Leu Ala Ala Cys Thr Asn Val Ala Ala Ser Leu Phe Lys Lys Leu 
115 120 125 



Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 
130 135 140 



Gly Leu Ala Pro Val Trp Pro Ala Phe lie lie Gly Met Ala Gly Trp 
145 150 155 160 



Leu Tyr Met lie Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 

165 170 175 



Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Val 
180 185 190 



lie lie Val Val Gly Trp Ala lie Tyr Pro Ala Gly Tyr Ala Ala Gly 
195 200 205 



Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu lie 
210 215 220 



: Tyr Asn Leu Ala Asp Leu Val Asn Lys lie Leu Phe Gly Leu lie lie 
! 225 230 235 240 



Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 24 

<211> 750 

<212> DNA 

<213> Naturally occurring gamma proteobacterium 
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<220> 

<221> CDS 

<222> (1) . . (750) 

<223> Proteorhodopsin variant from per clone MB 0ml : GenBank#AF349983 



<400> 24 

atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca 48 

Met Gly Lys Leu Leu Leu lie Leu Gly Ser Val lie Ala Leu Pro Thr 
15 10 15 

ttt get gca ggt ggt ggt gac ctt gat get agt gat tac act ggt gtt 96 
Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 
20 25 30 

] tct ttt tgg tta gtt act get get eta tta gca tct act gta ttt ttc 144 

i.'Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 
\ 35 40 45 

a ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tea tta act 192 

I Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 

Igta tct ggt ctt gtt act ggt att get ttc tgg cat tac atg tac atg 24 0 

IVal Ser Gly Leu Val Thr Gly lie Ala Phe Trp His Tyr Met Tyr Met 

^65 70 75 80 

k a.ga. ggg gta tgg att gag act ggt gat teg cca act gta ttt aga tac 288 

Arg Gly Val Trp lie Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 

85 90 95 

att gat tgg tta eta aca gtt cct eta ttg ata tgt gaa ttc tac tta 336 

lie Asp Trp Leu Leu Thr Val Pro Leu Leu lie Cys Glu Phe Tyr Leu 
100 105 110 

att ctt get get gca aca aat gtt get get ggc ctg ttt aag aaa tta 384 

lie Leu Ala Ala Ala Thr Asn Val Ala Ala Gly Leu Phe Lys Lys Leu 
115 120 125 

ttg gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gag gca 432 

Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 
130 135 140 
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gga att atg aac get tgg cct gca ttc att att ggg tgt tta get tgg 480 
Gly lie Met Asn Ala Trp Pro Ala Phe lie lie Gly Cys Leu Ala Trp 
145 150 155 160 



gta tac atg att tat gaa eta tat get gga gaa gga aaa tct gca tgt 528 
Val Tyr Met lie Tyr Glu Leu Tyr Ala Gly Glu Gly Lys Ser Ala Cys 

165 170 175 

aat act gca agt cct teg gtt caa tea get tac aac aca atg atg get 576 
Asn Thr Ala Ser Pro Ser Val Gin Ser Ala Tyr Asn Thr Met Met Ala 
180 185 190 

ate ata gtc ttc ggt tgg gca att tat cct gta ggt tat ttc aca ggt 624 
lie He Val Phe Gly Trp Ala He Tyr Pro Val Gly Tyr Phe Thr Gly 
195 200 205 

tac eta atg ggt gac ggt gga tea get ctt aac tta aac ctt att tat 672 
Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu He Tyr 
210 215 220 

aac ctt get gac ttt gtt aac aag att eta ttt ggt tta att ata tgg 720 
Asn Leu Ala Asp Phe Val Asn Lys He Leu Phe Gly Leu He He Trp 
225 230 235 240 

aat gtt get gtt aaa gaa tct tct aat get 750 
Asn Val Ala Val Lys Glu Ser Ser Asn Ala 
245 250 



<210> 25 
<211> 250 
<212> PRT 

<213> Naturally occurring gamma proteobacterium 
<400> 25 

Met Gly Lys Leu Leu Leu He Leu Gly Ser Val He Ala Leu Pro Thr 
15 10 15 



Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 
20 25 30 
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Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 

35 40 45 



Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 



Val Ser Gly Leu Val Thr Gly lie Ala Phe Trp His Tyr Met Tyr Met 
65 70 75 80 



Arg Gly Val Trp lie Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 

85 90 95 



lie Asp Trp Leu Leu Thr Val Pro Leu Leu lie Cys Glu Phe Tyr Leu 
100 105 110 



lie Leu Ala Ala Ala Thr Asn Val Ala Ala Gly Leu Phe Lys Lys Leu 
115 120 125 



Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 
130 135 140 



Gly He Met Asn Ala Trp Pro Ala Phe He He Gly Cys Leu Ala Trp 

145 150 155 160 



Val Tyr Met He Tyr Glu Leu Tyr Ala Gly Glu Gly Lys Ser Ala Cys 

165 170 175 



Asn Thr Ala Ser Pro Ser Val Gin Ser Ala Tyr Asn Thr Met Met Ala 
180 185 190 



He He Val Phe Gly Trp Ala He Tyr Pro Val Gly Tyr Phe Thr Gly 
195 200 205 



Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu He Tyr 
210 215 220 
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Asn Leu Ala Asp Phe Val Asn Lys He Leu Phe Gly Leu He He Trp 
225 230 235 240 



Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 26 

<211> 750 

<212> DNA 

<213> Naturally ocurring gamma proteobacterium 



<220> 

<221> CDS 

<222> (1) . . (750) 

<223> Proteorhodopsin variant from per clone MB0m2 



<400> 26 
atg ggt aaa 
Met Gly Lys 
1 

ttt get gca 
Phe Ala Ala 



tct ttt tgg 
Ser Phe Trp 
35 

ttt gtt gaa 
Phe Val Glu 
50 

gta tct ggt 



tta tta ctg 
Leu Leu Leu 
5 

ggt ggt ggt 
Gly Gly Gly 
20 

tta gtt act 
Leu Val Thr 



aga gat aga 
Arg Asp Arg 



ctt gtt act 



ata tta ggt 
He Leu Gly 



gac ctt gat 
Asp Leu Asp 
25 

get get tta 
Ala Ala Leu 
40 

gtt tct gca 
Val Ser Ala 
55 

ggt att get 



agt gtt att 
Ser Val He 
10 

get agt gat 
Ala Ser Asp 



tta gca tct 
Leu Ala Ser 



aaa tgg aaa 
Lys Trp Lys 
60 

ttc tgg cat 



gca ctt cct 
Ala Leu Pro 
15 

tac act ggt 
Tyr Thr Gly 
30 

act gta ttt 
Thr Val Phe 
45 

aca tea tta 
Thr Ser Leu 



tac atg tac 



aca 48 
Thr 



gtt 96 
Val 



ttc 144 
Phe 



act 192 
Thr 



atg 240 
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Val Ser Gly Leu Val Thr Gly lie Ala Phe Trp His Tyr Met Tyr Met 
65 70 75 80 



aga ggg gta 
Arg Gly Val 



att gat tgg 
lie Asp Trp 



att ctt get 
lie Leu Ala 
115 

ttg gtt ggt 
Leu Val Gly 
130 

gga att atg 
Gly lie Met 
145 

gta tac atg 
Val Tyr Met 



aat aca gca 
Asn Thr Ala 



ate ate ate 
lie lie lie 
195 

tac eta atg 
Tyr Leu Met 
210 

aac ctt get 
Asn Leu Ala 
225 

aat gtt get 
Asn Val Ala 



tgg att gaa 
Trp lie Glu 
85 

tta eta aca 
Leu Leu Thr 
100 

get get act 
Ala Ala Thr 



tct ctt gtt 
Ser Leu Val 



aac get tgg 
Asn Ala Trp 
150 

att tat gag 
lie Tyr Glu 
165 

agt cct get 
Ser Pro Ala 
180 

ttt ggt tgg 
Phe Gly Trp 



ggt gac ggt 
Gly Asp Gly 



gac ttt gtt 
Asp Phe Val 
230 

gtt aaa gaa 
Val Lys Glu 
245 



act ggt gat 
Thr Gly Asp 



gtt cct eta 
Val Pro Leu 
105 

aat gtt get 
Asn Val Ala 
120 

atg ctt gtg 
Met Leu Val 
135 

ggt gca ttc 
Gly Ala Phe 



ctt tgg ctt 
Leu Trp Leu 



gtt cag tea 
Val Gin Ser 
185 

gca att tat 
Ala lie Tyr 
200 

gga tea gca 
Gly Ser Ala 
215 

aac aag att 
Asn Lys lie 



tct tct aat 
Ser Ser Asn 



teg cca act 
Ser Pro Thr 
90 

tta ata tgt 
Leu lie Cys 



get ggc ctg 
Ala Gly Leu 



ttt ggt tac 
Phe Gly Tyr 
140 

gtt att ggg 
Val lie Gly 
155 

gga gaa gga 
Gly Glu Gly 
170 

get tac aac 
Ala Tyr Asn 



cct gta ggt 
Pro Val Gly 



ctt aac tta 
Leu Asn Leu 
220 

eta ttt ggt 
Leu Phe Gly 
235 

get 
Ala 
250 



gta ttt aga 
Val Phe Arg 
95 

gaa ttc tac 
Glu Phe Tyr 
110 

ttt aag aaa 
Phe Lys Lys 
125 

atg ggt gaa 
Met Gly Glu 



tgt tta get 
Cys Leu Ala 



aaa get gcg 
Lys Ala Ala 
175 

aca atg atg 
Thr Met Met 
190 

tat ttc aca 
Tyr Phe Thr 
205 

aac ctt ate 
Asn Leu lie 



tta att ata 
Leu lie lie 



tac 288 
Tyr 

tta 336 
Leu 



tta 384 
Leu 



gca 432 
Ala 



tgg 480 

Trp 

160 

tgt 528 
Cys 



atg 576 
Met 



ggt 624 
Gly 



tat 672 
Tyr 



tgg 72 0 

Trp 

240 

750 
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<210> 27 
<211> 250 
<212> PRT 

<213> Naturally ocurring gamma proteobacterium 
<400> 27 

Met Gly Lys Leu Leu Leu lie Leu Gly Ser Val lie Ala Leu Pro Thr 
15 10 15 



Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 
20 25 30 



Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 
35 40 45 



Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 



Val Ser Gly Leu Val Thr Gly lie Ala Phe Trp His Tyr Met Tyr Met 
65 70 75 80 



Arg Gly Val Trp lie Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 

85 90 95 



lie Asp Trp Leu Leu Thr Val Pro Leu Leu lie Cys Glu Phe Tyr Leu 
100 105 110 



lie Leu Ala Ala Ala Thr Asn Val Ala Ala Gly Leu Phe Lys Lys Leu 
115 120 125 



Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 
130 135 140 
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Gly lie Met Asn Ala Trp Gly Ala Phe Val lie Gly Cys Leu Ala Trp 
145 150 155 160 



Val Tyr Met lie Tyr Glu Leu Trp Leu Gly Glu Gly Lys Ala Ala Cys 

165 170 175 

Asn Thr Ala Ser Pro Ala Val Gin Ser Ala Tyr Asn Thr Met Met Met 
180 185 190 

He He He Phe Gly Trp Ala He Tyr Pro Val Gly Tyr Phe Thr Gly 
195 200 205 

Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu lie Tyr 

210 215 220 

Asn Leu Ala Asp Phe Val Asn Lys He Leu Phe Gly Leu He He Trp 
225 230 235 240 

Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 

<210> 28 
<211> 750 
<212> DNA 

<213> Naturally occuring gamma proteobacterium 
<220> 

<221> CDS 

<222> (1) . . (750) 

<223> Proteorhodopsin variant from per clone MB20m2; GenBank #AF349985 
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<400> 28 

atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca 
Met Gly Lys Leu Leu Leu He Leu Gly Ser Val He Ala Leu Pro Thr 
15 10 15 

ttt get gca ggt ggt ggt gac ctt gat get agt gat tac act ggt gtt 
Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 
20 25 30 

tct ttt tgg tta gtt act get get tta tta gca tct act gta ttt ttc 
Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 
35 40 45 

ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tea tta act 
Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 

gta tct ggt ctt gtt act ggt att get ttc tgg cat tac atg tac atg 
Val Ser Gly Leu Val Thr Gly He Ala Phe Trp His Tyr Met Tyr Met 
65 70 75 80 

aga ggg gta tgg att gaa act ggt gat teg cca act gta ttt aga tac 

Arg Gly Val Trp He Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 

85 90 95 

att gat tgg tta eta aca gtt cct eta tta ata tgt gaa ttc tac tta 
He Asp Trp Leu Leu Thr Val Pro Leu Leu He Cys Glu Phe Tyr Leu 
100 105 HO 

att ctt get get gca act aat gtt get get ggc ctg ttt aag aaa tta 
He Leu Ala Ala Ala Thr Asn Val Ala Ala Gly Leu Phe Lys Lys Leu 
115 120 125 

ttg gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gag gca 
Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 
130 135 140 

gga att atg aac get tgg ggt gca ttc gtt att ggg tgt tta get tgg 
Gly He Met Asn Ala Trp Gly Ala Phe Val He Gly Cys Leu Ala Trp 
145 150 155 160 

gta tac atg att tat gaa eta tgg get gga gaa ggc aag get gca tgt 
Val Tyr Met He Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ala Ala Cys 

165 170 175 

aat act gca agt cct get gtg caa tea get tac aac aca atg atg tat 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 
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Asn Thr Ala Ser Pro Ala Val Gin Ser Ala Tyr Asn Thr Met Met Tyr 
180 185 190 



ata ate ate ttt ggt tgg gca att tat cct gta ggt tat ttc aca ggt 
He He He Phe Gly Trp Ala He Tyr Pro Val Gly Tyr Phe Thr Gly 
195 200 205 

tac eta atg ggt gac ggt gga tea get ctt aac tta aac ctt ate tat 
Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu He Tyr 
210 215 220 

aac ctt get gac ttt gtt aac aag att eta ttt ggt tta att ata tgg 
Asn Leu Ala Asp Phe Val Asn Lys He Leu Phe Gly Leu He He Trp 
225 230 235 240 

aat gtt get gtt aaa gaa tct tct aat get 
Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 29 
<211> 250 
<212> PRT 

<213> Naturally occuring gamma proteobacterium 
<400> 29 

Met Gly Lys Leu Leu Leu He Leu Gly Ser Val He Ala Leu Pro Thr 
15 10 15 



Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 
20 25 30 



Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 
35 40 45 



Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 



624 



672 



720 



750 
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Val Ser Gly Leu Val Thr Gly He Ala Phe Trp His Tyr Met Tyr Met 
65 70 75 80 



Arg Gly Val Trp He Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 

85 90 95 



He Asp Trp Leu Leu Thr Val Pro Leu Leu He Cys Glu Phe Tyr Leu 
100 105 HO 



He Leu Ala Ala Ala Thr Asn Val Ala Ala Gly Leu Phe Lys Lys Leu 
115 120 125 



Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 
130 135 140 



Gly He Met Asn Ala Trp Gly Ala Phe Val He Gly Cys Leu Ala Trp 
145 150 155 160 



Val Tyr Met He Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ala Ala Cys 

165 170 175 



Asn Thr Ala Ser Pro Ala Val Gin Ser Ala Tyr Asn Thr Met Met Tyr 
180 185 190 



He He He Phe Gly Trp Ala He Tyr Pro Val Gly Tyr Phe Thr Gly 
195 200 205 



Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu He Tyr 
210 215 220 



Asn Leu Ala Asp Phe Val Asn Lys He Leu Phe Gly Leu He He Trp 
225 230 235 240 



Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



Page 122 



<210> 30 

<211> 750 

<212> DNA 

<213> Naturally occurring gamma proteobacterium 



<220> 

<221> CDS 

<222> (1) . . (750) 

<223> Proteorhodopsin variant from per clone MB20m5; GenBank#AF3 49986 



<400> 30 

atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca 48 

Met Gly Lys Leu Leu Leu lie Leu Gly Ser Val lie Ala Leu Pro Thr 
15 10 15 

ttt get gca ggt ggc ggt gac ctt gat get agt gat tac act ggt gtt 96 
Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 
20 25 30 

tct ttt tgg tta gtt aca get get eta tta gca tct act gta ttt ttc 144 
Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 
35 40 45 

ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tea tta act 192 
Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 

gta tct ggt ctt gtt act ggt att get ttc tgg cat tac atg tac atg 240 
Val Ser Gly Leu Val Thr Gly lie Ala Phe Trp His Tyr Met Tyr Met 
65 70 75 80 

aga ggg gta tgg att gaa act ggt gat teg cca act gta ttt aga tac 288 
Arg Gly Val Trp He Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 

85 90 95 

att gat tgg tta eta aca gtt cct eta tta ata tgt gaa ttc tac tta 336 
He Asp Trp Leu Leu Thr Val Pro Leu Leu He Cys Glu Phe Tyr Leu 
100 105 110 
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att ctt get get get act aat gtt get gga tea tta ttt aag aaa tta 384 
lie Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 
115 120 125 

eta gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gaa gca 432 
Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 
130 135 140 

caa att atg get gca tgg cct gca ttc att att ggg tgt tta get tgg 480 
Gin He Met Ala Ala Trp Pro Ala Phe He He Gly Cys Leu Ala Trp 
145 150 155 160 

gta tac atg att tat gaa eta tat get gga gaa gga aaa tct gca tgt 528 
Val Tyr Met He Tyr Glu Leu Tyr Ala Gly Glu Gly Lys Ser Ala Cys 

165 170 175 

aat act gca agt cct teg gtt caa tea get tac aac aca atg atg get 576 
Asn Thr Ala Ser Pro Ser Val Gin Ser Ala Tyr Asn Thr Met Met Ala 
180 185 190 

ate ata gtc ttc ggt tgg gca att tat cct gta ggt tat ttc aca ggt 624 
He He Val Phe Gly Trp Ala He Tyr Pro Val Gly Tyr Phe Thr Gly 
195 200 205 

tac eta atg ggt gac ggt ggg tea get ctt aac tta aac ctt att tat 672 
Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu He Tyr 
210 215 220 

aac ctt get gac ttt gtt aac aag att eta ctt ggt tta att ata tgg 720 
Asn Leu Ala Asp Phe Val Asn Lys He Leu Leu Gly Leu He He Trp 
225 230 235 240 

aat gtt get gtt aaa gaa tct tct aat get 750 
Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 31 

<211> 250 

<212> PRT 

<213> Naturally occurring gamma proteobacterium 
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<400> 31 



Met Gly Lys Leu Leu Leu He Leu Gly Ser Val He Ala Leu Pro Thr 

1 5 10 15 



Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 
20 25 30 



Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 
35 40 45 



Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 



Val Ser Gly Leu Val Thr Gly He Ala Phe Trp His Tyr Met Tyr Met 
65 70 75 80 



Arg Gly Val Trp He Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 

85 90 95 



He Asp Trp Leu Leu Thr Val Pro Leu Leu He Cys Glu Phe Tyr Leu 
100 105 110 



He Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 

115 120 125 



Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 
130 135 140 



Gin He Met Ala Ala Trp Pro Ala Phe He He Gly Cys Leu Ala Trp 
i45 150 155 160 



Val Tyr Met He Tyr Glu Leu Tyr Ala Gly Glu Gly Lys Ser Ala Cys 

165 170 175 



Asn Thr Ala Ser Pro Ser Val Gin Ser Ala Tyr Asn Thr Met Met Ala 
180 185 190 
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He He Val Phe Gly Trp Ala He Tyr Pro Val Gly Tyr Phe Thr Gly 
195 200 205 

Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu He Tyr 

210 215 220 

Asn Leu Ala Asp Phe Val Asn Lys He Leu Leu Gly Leu He He Trp 
225 230 235 240 

Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 

<210> 32 
<211> 750 
<212> DNA 

<213> Naturally occurring gamma proteobacterium 
<220> 

<221> CDS 

<222> (1)..(750) 

<223> Proteorhodopsin variant from per clone MB20ml2; GenBank #AF349987 



<400> 32 

atg ggt aaa tta tta ctg ata tta ggt 
Met Gly Lys Leu Leu Leu He Leu Gly 
1 5 

ttt get gca ggt ggt ggt gac ctt gat 
Phe Ala Ala Gly Gly Gly Asp Leu Asp 
20 25 

tct ttt tgg tta gtt act get get tta 



agt gtt att gca ctt cct aca 48 
Ser Val He Ala Leu Pro Thr 
10 15 

get agt gat tac act ggt gtt 96 
Ala Ser Asp Tyr Thr Gly Val 

30 

tta gca tct act gta ttt ttc 144 
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Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 
35 40 45 



ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tea tta act 192 
Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 

gta tct ggt ctt gtt act ggt att get ttc tgg cat tac atg tac atg 240 
Val Ser Gly Leu Val Thr Gly lie Ala Phe Trp His Tyr Met Tyr Met 
65 70 75 80 

aga ggg gta tgg att gaa act ggt gat teg cca act gta ttt aga tac 288 
Arg Gly Val Trp He Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 

85 90 95 

att gat tgg tta eta aca gtt cct eta tta ata tgt gaa ttc tac tta 336 
He Asp Trp Leu Leu Thr Val Pro Leu Leu He Cys Glu Phe Tyr Leu 
100 105 110 

att ctt get get gca get aat gtt get gga tea tta ttt aag aaa tta 384 
He Leu Ala Ala Ala Ala Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 
115 120 125 

eta gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gaa gca 432 
Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 
130 135 140 

gga ate atg get gca tgg cct gca ttc att att ggg tgt tta get tgg 480 
Gly He Met Ala Ala Trp Pro Ala Phe He He Gly Cys Leu Ala Trp 
145 150 155 160 

gta tac atg att tat gaa tta tgg get gga gaa gga aaa tct gca tgt 528 
Val Tyr Met He Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys 

165 170 175 

aat act gca agt cct get gtg caa tea gee tac aac aca atg atg tat 576 
Asn Thr Ala Ser Pro Ala Val Gin Ser Ala Tyr Asn Thr Met Met Tyr 
180 185 190 

att ate ate ttt ggt tgg gcg att tat cct gta ggt tat ttc aca ggt 624 
He He He Phe Gly Trp Ala He Tyr Pro Val Gly Tyr Phe Thr Gly 
195 200 205 

tac ttg atg ggt gac ggt gga tea get ctt aac tta aac ctt ate tat 672 
Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu He Tyr 
210 215 220 

aac ctt get gac ttt gtt aac aag att eta ttt ggt tta att ata tgg 720 
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Asn Leu Ala Asp Phe Val Asn Lys lie Leu Phe Gly Leu lie lie Trp 

225 230 235 240 



aat gtt get gtt aaa gaa tct tct aat get 750 
Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 33 
<211> 250 
<212> PRT 

<213> Naturally occurring gamma proteobacterium 
<400> 33 

Met Gly Lys Leu Leu Leu lie Leu Gly Ser Val lie Ala Leu Pro Thr 
15 10 15 



Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 
20 25 30 



Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 

35 40 45 



Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 



Val Ser Gly Leu Val Thr Gly lie Ala Phe Trp His Tyr Met Tyr Met 
65 70 75 80 



Arg Gly Val Trp He Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 

85 90 95 



He Asp Trp Leu Leu Thr Val Pro Leu Leu He Cys Glu Phe Tyr Leu 
100 105 110 
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lie Leu Ala Ala Ala Ala Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 

115 120 125 



Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 
130 135 140 



Gly lie Met Ala Ala Trp Pro Ala Phe lie lie Gly Cys Leu Ala Trp 
145 150 155 160 



Val Tyr Met He Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys 

165 170 175 



Asn Thr Ala Ser Pro Ala Val Gin Ser Ala Tyr Asn Thr Met Met Tyr 

180 185 190 



He He He Phe Gly Trp Ala He Tyr Pro Val Gly Tyr Phe Thr Gly 
195 200 205 



Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu He Tyr 
210 215 220 



Asn Leu Ala Asp Phe Val Asn Lys He Leu Phe Gly Leu He He Trp 

225 230 235 240 



Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 34 

<211> 750 

<212> DNA 

<213> Naturally occurring gamma proteobacterium 

<220> 
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<221> 
<222> 
<223> 



CDS 

(1) . . (750) 

Proteorhodopsin variant from per clone MB40ml; GenBank #AF349988 



<400> 34 

atg ggt aaa tta tta ctg ata ata ggt agt gtt att gca ctt cct aca 48 
Met Gly Lys Leu Leu Leu He He Gly Ser Val He Ala Leu Pro Thr 
1 5 10 15 

ttt get gca ggt ggc ggt gac ctt gat get agt gat tac act ggt gtt 96 
Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 
20 25 30 

tct ttt tgg tta gtt aca get get eta tta gca tct act gta ttt ttc 144 
Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 
35 40 45 

ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tea tta act 192 
Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 

gta tct ggt ctt gtt act ggt att get ttc tgg cat tac atg tac atg 240 
Val Ser Gly Leu Val Thr Gly He Ala Phe Trp His Tyr Met Tyr Met 
55 70 75 80 

aga gga gta tgg att gaa act ggt gat teg cca act gta ttt aga tac 288 
Arg Gly Val Trp He Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 

85 90 95 

att gat tgg tta eta aca gtt cct tta tta ata tgt gaa ttc tac tta 336 
He Asp Trp Leu Leu Thr Val Pro Leu Leu He Cys Glu Phe Tyr Leu 
100 105 HO 

att ctt get get gca act aat gtt gee ggc tea tta ttt aag aaa ctt 3 84 

He Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 
115 120 125 

eta gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gaa gca 432 
Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 
130 135 140 

gga att atg gca get tgg cct gca ttc att att ggg tgt tta get tgg 480 
Gly He Met Ala Ala Trp Pro Ala Phe He He Gly Cys Leu Ala Trp 
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145 150 155 



160 



gta tat atg att tat gaa eta tat get gga gaa gga aaa tct gca tgt 
Val Tyr Met lie Tyr Glu Leu Tyr Ala Gly Glu Gly Lys Ser Ala Cys 

165 170 175 



aat gtt get gtt aaa gaa tct tct aat get 
Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 35 
<211> 250 
<212> PRT 

<213> Naturally occurring gamma proteobacterium 
<400> 35 

Met Gly Lys Leu Leu Leu He He Gly Ser Val He Ala Leu Pro Thr 
15 10 15 

Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 
20 25 30 

Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 



528 



aat aca gca agt cct get gtg caa tea get tac aac aca atg atg tat 576 
Asn Thr Ala Ser Pro Ala Val Gin Ser Ala Tyr Asn Thr Met Met Tyr 
180 185 190 

att ate gtc ttt ggt tgg gcg att tat cct gta ggt tat ttc aca ggt 624 
He He Val Phe Gly Trp Ala He Tyr Pro Val Gly Tyr Phe Thr Gly 
195 200 205 

tac ctg atg ggt gac ggt gga tea get ctt aac tta aac ctt ate tat 672 
Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu He Tyr 
210 215 220 

aac ctt get gac ttt gtt aac aag att eta ttt ggt tta att ata tgg 72 0 

Asn Leu Ala Asp Phe Val Asn Lys He Leu Phe Gly Leu He lie Trp 
225 230 235 240 



750 
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35 



40 



45 



Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 



Val Ser Gly Leu Val Thr Gly lie Ala Phe Trp His Tyr Met Tyr Met 
65 70 75 80 



Arg Gly Val Trp He Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 

85 90 95 



He Asp Trp Leu Leu Thr Val Pro Leu Leu He Cys Glu Phe Tyr Leu 
100 105 110 



He Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 
115 120 125 



Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 
130 135 140 



Gly He Met Ala Ala Trp Pro Ala Phe He He Gly Cys Leu Ala Trp 
14 5 150 155 160 



Val Tyr Met He Tyr Glu Leu Tyr Ala Gly Glu Gly Lys Ser Ala Cys 

165 170 175 



Asn Thr Ala Ser Pro Ala Val Gin Ser Ala Tyr Asn Thr Met Met Tyr 
180 185 190 



He He Val Phe Gly Trp Ala He Tyr Pro Val Gly Tyr Phe Thr Gly 
195 200 205 



Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu He Tyr 
210 215 220 



Asn Leu Ala Asp Phe Val Asn Lys He Leu Phe Gly Leu He He Trp 
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225 



230 



235 



240 



Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 36 

<211> 750 

<212> DNA 

<213> Naturally occurring gamma proteobacterium 



<220> 

<221> CDS 

<222> (1) . . (750) 

<223> Proteorhodopsin variant from per clone MB40m5;p GenBank #AF349989 



<400> 36 

atg ggt aaa tta tta ctg ata tta ggt 

Met Gly Lys Leu Leu Leu lie Leu Gly 

1 5 

ttt get gca ggt ggt ggt gac ctt gat 

Phe Ala Ala Gly Gly Gly Asp Leu Asp 
20 25 

tct ttt tgg tta gtt act get get eta 
Ser Phe Trp Leu Val Thr Ala Ala Leu 
35 40 

ttt gtt gaa aga gat aga gtt tct gca 
Phe Val Glu Arg Asp Arg Val Ser Ala 
50 55 

gta teg ggt ctt gtt act ggt att get 
Val Ser Gly Leu Val Thr Gly lie Ala 
65 70 



agt gtt att gca ctt cct aca 48 
Ser Val lie Ala Leu Pro Thr 
10 15 

get agt gat tac act ggt gtt 96 
Ala Ser Asp Tyr Thr Gly Val 

30 

tta gca tct act gta ttt ttc 144 
Leu Ala Ser Thr Val Phe Phe 
45 

aaa tgg aaa aca tea tta act 192 
Lys Trp Lys Thr Ser Leu Thr 
60 

ttc tgg cat tac atg tac atg 240 
Phe Trp His Tyr Met Tyr Met 
75 80 
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aga ggg gta tgg att gag act ggt gat teg cca act gta ttt aga tac 288 
Arg Gly Val Trp lie Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 

85 90 95 

att gat tgg tta eta aca gtt cct eta ttg ata tgt gaa ttc tac tta 33 6 

lie Asp Trp Leu Leu Thr Val Pro Leu Leu lie Cys Glu Phe Tyr Leu 
100 105 110 

att ctt get get gca aca aat gtt get get ggc ctg ttt aag aaa tta 384 
lie Leu Ala Ala Ala Thr Asn Val Ala Ala Gly Leu Phe Lys Lys Leu 
115 120 125 

ttg gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gag gca 432 
Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 
130 135 140 

gga att atg aac get tgg ggt gca ttc gtt att ggg tgt tta get tgg 480 
Gly lie Met Asn Ala Trp Gly Ala Phe Val lie Gly Cys Leu Ala Trp 
145 150 155 160 

gta tac atg att tat gaa eta tgg get gga gaa ggc aag get gca tgt 528 
Val Tyr Met He Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ala Ala Cys 

165 170 175 

aat act gca agt cct get gtg caa tea get tac aac aca atg atg tat 576 
Asn Thr Ala Ser Pro Ala Val Gin Ser Ala Tyr Asn Thr Met Met Tyr 
180 185 190 

ata ate ate ttt ggt tgg gca att tat cct gta ggt tat ttc aca ggt 624 
He He He Phe Gly Trp Ala He Tyr Pro Val Gly Tyr Phe Thr Gly 
195 200 205 

tac eta atg ggt gac ggt gga tea get ctt aac tta aac ctt ate tat 672 
Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu He Tyr 
210 215 220 

aac ctt get gac ttt gtt aac aag aat eta ttt ggt tta att ata tgg 720 
Asn Leu Ala Asp Phe Val Asn Lys Asn Leu Phe Gly Leu He He Trp 
225 230 235 240 

aat gtt get gtt aaa gaa tct tct aat get 750 
Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 37 
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<211> 
<212> 
<213> 



250 
PRT 

Naturally occurring gamma proteobacterium 



<400> 37 

Met Gly Lys Leu Leu Leu lie Leu Gly Ser Val lie Ala Leu Pro Thr 
15 10 15 



Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 

20 25 30 



Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 
35 40 45 



Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 

50 55 60 



Val Ser Gly Leu Val Thr Gly lie Ala Phe Trp His Tyr Met Tyr Met 
65 70 75 80 



Arg Gly Val Trp lie Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 

85 90 95 



lie Asp Trp Leu Leu Thr Val Pro Leu Leu lie Cys Glu Phe Tyr Leu 
100 105 110 



lie Leu Ala Ala Ala Thr Asn Val Ala Ala Gly Leu Phe Lys Lys Leu 

115 120 125 



Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 
130 135 140 



Gly lie Met Asn Ala Trp Gly Ala Phe Val lie Gly Cys Leu Ala Trp 

145 150 155 160 
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Val Tyr Met lie Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ala Ala Cys 

165 170 175 



Asn Thr Ala Ser Pro Ala Val Gin Ser Ala Tyr Asn Thr Met Met Tyr 
180 185 190 



lie lie lie Phe Gly Trp Ala lie Tyr Pro Val Gly Tyr Phe Thr Gly 

195 200 205 



Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu lie Tyr 
210 215 220 



Asn Leu Ala Asp Phe Val Asn Lys Asn Leu Phe Gly Leu lie lie Trp 
225 230 235 240 



Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 38 

<211> 750 

<212> DNA 

<213> Naturally occurring gamma proteobacterium 
<220> 

<221> CDS 

<222> (1) . . (750) 

<223> Proteorhodopsin variant from per clone MB40ml2; GenBank # AF34999 



<400> 38 
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atg ggt aaa 
Met Gly Lys 
1 

ttt get gca 
Phe Ala Ala 



tct ttt tgg 
Ser Phe Trp 
35 

ttt gtt gaa 
Phe Val Glu 
50 

gta tct ggt 
Val Ser Gly 
65 

aga gga gta 
Arg Gly Val 



att gat tgg 
lie Asp Trp 



att ctt get 
lie Leu Ala 
115 

eta gtt ggt 
Leu Val Gly 
130 

gga ate atg 
Gly He Met 
145 

gta tac atg 
Val Tyr Met 



aat act gca 
Asn Thr Ala 



tta tta egg 
Leu Leu Arg 
5 

ggt ggc ggt 
Gly Gly Gly 
20 

tta gtt aca 
Leu Val Thr 



aga gat aga 
Arg Asp Arg 



ctt gtt act 
Leu Val Thr 
70 

tgg att gaa 
Trp He Glu 
85 

tta eta aca 
Leu Leu Thr 
100 

get gca act 
Ala Ala Thr 



tct ctt gtt 
Ser Leu Val 



get gca tgg 
Ala Ala Trp 
150 

att tat gaa 
He Tyr Glu 
165 

agt cct get 
Ser Pro Ala 
180 



ata tta ggt 
He Leu Gly 



gac ctt gat 
Asp Leu Asp 
25 

get get eta 
Ala Ala Leu 
40 

gtt tct gca 
Val Ser Ala 
55 

ggt att get 
Gly He Ala 



act ggt gat 
Thr Gly Asp 



gtt cct tta 
Val Pro Leu 
105 

aat gtt get 
Asn Val Ala 
120 

atg ctt gtg 
Met Leu Val 
135 

cct gca ttc 
Pro Ala Phe 



eta tgg get 
Leu Trp Ala 



gtg caa tea 
Val Gin Ser 
185 



agt gtt att 
Ser Val He 
10 

get agt gat 
Ala Ser Asp 



tta gca tct 
Leu Ala Ser 



aaa tgg aaa 
Lys Trp Lys 
60 

ttc tgg cat 
Phe Trp His 
75 

teg cca act 
Ser Pro Thr 
90 

tta ata tgt 
Leu He Cys 



gga tea tta 
Gly Ser Leu 



ttt ggt tac 
Phe Gly Tyr 
140 

att att ggg 
He He Gly 
155 

gga gaa gga 
Gly Glu Gly 
170 

get tac aac 
Ala Tyr Asn 



gca ctt cct 
Ala Leu Pro 
15 

tac act ggt 
Tyr Thr Gly 
30 

act gta ttt 
Thr Val Phe 
45 

aca tea tta 
Thr Ser Leu 



tac atg tat 
Tyr Met Tyr 



gta ttt aga 
Val Phe Arg 
95 

gaa ttc tac 
Glu Phe Tyr 
110 

ttt aag aaa 
Phe Lys Lys 
125 

atg ggt gaa 
Met Gly Glu 



tgt tta get 
Cys Leu Ala 



aaa tct gca 
Lys Ser Ala 
175 

aca atg atg 
Thr Met Met 
190 



aca 48 
Thr 



gtt 96 
Val 



ttc 144 
Phe 



act 192 
Thr 



atg 240 

Met 

80 

tac 288 
Tyr 



tta 336 
Leu 



tta 384 
Leu 



gca 432 
Ala 



tgg 480 

Trp 

160 

tgt 528 
Cys 



tat 576 
Tyr 
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ate ate ate gtt ggt tgg gcg att tat cct gta ggt tat ttc aca ggt 624 
lie lie lie Val Gly Trp Ala lie Tyr Pro Val Gly Tyr Phe Thr Gly 
195 200 205 



tac ctg atg ggt gac ggt gga tea get ctt aac tta aac ctt ate tat 672 

Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu. Asn Leu Asn Leu lie Tyr 
210 215 220 

aac ctt get gac ttt gtt aac aag att eta ttt ggt tta att ata tgg 72 0 

Asn Leu Ala Asp Phe Val Asn Lys lie Leu Phe Gly Leu lie lie Trp 
225 230 235 240 

aat gtt get gtt aaa gaa tct tct aat get 750 
Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 39 
<211> 250 
<212> PRT 

<213> Naturally occurring gamma proteobacteriuin 
<400> 39 

Met Gly Lys Leu Leu Arg lie Leu Gly Ser Val lie Ala Leu Pro Thr 
15 10 15 



Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 
20 25 30 



Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 
35 40 45 



Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 



Val Ser Gly Leu Val Thr Gly lie Ala Phe Trp His Tyr Met Tyr Met 

65 70 75 80 
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Arg Gly Val Trp lie Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 

85 90 95 



lie Asp Trp Leu Leu Thr Val Pro Leu Leu lie Cys Glu Phe Tyr Leu 
100 105 110 



lie Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 
115 120 125 



Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 
130 135 140 



Gly lie Met Ala Ala Trp Pro Ala Phe lie lie Gly Cys Leu Ala Trp 
145 150 155 160 



Val Tyr Met lie Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys 

165 170 175 



Asn Thr Ala Ser Pro Ala Val Gin Ser Ala Tyr Asn Thr Met Met Tyr 
180 185 190 



lie lie lie Val Gly Trp Ala lie Tyr Pro Val Gly Tyr Phe Thr Gly 
195 200 205 



Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu lie Tyr 
210 215 220 



Asn Leu Ala Asp Phe Val Asn Lys lie Leu Phe Gly Leu lie lie Trp 
225 230 235 240 



Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 40 
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<211> 
<212> 
<213> 



750 
DNA 

Naturally occurring gamma proteobacterium 



<220> 

<221> CDS 

<222> (1) . . (750) 

<223> Proteorhodopsin variant from per clone MBl00m5; GenBank #AF349991 



<400> 40 

atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca 48 
Met Gly Lys Leu Leu Leu lie Leu Gly Ser Val lie Ala Leu Pro Thr 
15 10 15 

ttt get gca ggt ggc ggt gac ctt gat get agt gat tac act ggt gtt 96 
Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 



tct ttt tgg tta gtt aca get get eta tta gca tct act gta ttt ttc 
Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 
35 40 45 



144 



ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tea tta act 
Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 



192 



gta tct ggt ctt gtt act ggt att get ttc tgg cat tac atg tac atg 
Val Ser Gly Leu Val Thr Gly lie Ala Phe Trp His Tyr Met Tyr Met 
65 70 75 80 



240 



aga gga gta tgg att gaa act ggt gat teg cca act gta ttt aga tac 
Arg Gly Val Trp lie Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 

85 90 95 



288 



att gat tgg tta eta aca gtt cct tta tta ata tgt gaa ttc tac tta 
lie Asp Trp Leu Leu Thr Val Pro Leu Leu lie Cys Glu Phe Tyr Leu 
100 105 110 



336 



att ctt get get gca act aat gtt gec ggc tea tta ttt aag aaa ctt 



384 
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He Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 

115 120 125 

eta gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gaa gca 
Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 
130 135 140 

gga att atg gca get tgg cct gca ttc att att ggg tgt tta get tgg 
Gly He Met Ala Ala Trp Pro Ala Phe He He Gly Cys Leu Ala Trp 
145 150 155 160 

gta tac atg att tat gaa eta tat get gga gaa gga aaa tct gca tgt 
Val Tyr Met He Tyr Glu Leu Tyr Ala Gly Glu Gly Lys Ser Ala Cys 

165 170 175 

aat act gca agt cct teg gtt caa tea get tac aac aca atg atg get 
Asn Thr Ala Ser Pro Ser Val Gin Ser Ala Tyr Asn Thr Met Met Ala 
180 185 190 

ate ata gtc ttc ggt tgg gca att tat cct gta ggt tat ttc aca ggt 
He He Val Phe Gly Trp Ala He Tyr Pro Val Gly Tyr Phe Thr Gly 
195 200 205 

tac eta atg ggt gac ggt gga tea get ctt aac tta aac ctt att tat 
Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu He Tyr 
210 215 220 

aac ctt get gac ttt gtt aac aag att eta ttt ggt tta att ata tgg 
Asn Leu Ala Asp Phe Val Asn Lys He Leu Phe Gly Leu He He Trp 
225 230 235 240 

aat gtt get gtt aaa gaa tct tct aat get 
Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 41 

<211> 250 

<212> PRT 

<213> Naturally occurring gamma proteobacterium 



432 



480 



528 



576 



624 



672 



720 



750 



<400> 41 
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Met Gly Lys Leu Leu Leu He Leu Gly Ser Val He Ala Leu Pro Thr 
15 10 15 



Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 
20 25 30 



Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 
35 40 45 



Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 



Val Ser Gly Leu Val Thr Gly He Ala Phe Trp His Tyr Met Tyr Met 
65 70 75 80 



Arg Gly Val Trp He Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 

85 90 95 



He Asp Trp Leu Leu Thr Val Pro Leu Leu He Cys Glu Phe Tyr Leu 
100 105 HO 



He Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 
115 120 125 



Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 
130 135 140 



Gly He Met Ala Ala Trp Pro Ala Phe He He Gly Cys Leu Ala Trp 
145 150 155 160 



Val Tyr Met He Tyr Glu Leu Tyr Ala Gly Glu Gly Lys Ser Ala Cys 

165 170 175 



Asn Thr Ala Ser Pro Ser Val Gin Ser Ala Tyr Asn Thr Met Met Ala 
180 185 190 
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lie lie Val Phe Gly Trp Ala lie Tyr Pro Val Gly Tyr Phe Thr Gly 
195 200 205 



Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu lie Tyr 
210 215 220 



Asn Leu Ala Asp Phe Val Asn Lys lie Leu Phe Gly Leu lie lie Trp 
225 230 235 240 



Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 42 

<211> 750 

J <212> DNA 

£ <213> Naturally occurring gamma proteobacterium 



<220> 

<221> CDS 

<222> (1)..(750) 

<223> Proteorhodopsin variant from per clone MBl00m7; GenBank #AF349992 



<400> 42 

atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca 48 
Met Gly Lys Leu Leu Leu lie Leu Gly Ser Val lie Ala Leu Pro Thr 

15 10 15 

ttt get gca ggt ggt ggt gac ctt gat get agt gat tac act ggt gtt 96 
Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 
20 25 30 

tct ttt tgg tta gtt act get get tta tta gca tct act gta ttt ttc 144 
Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 

35 40 45 
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ttt gtt gaa 
Phe Val Glu 
50 

gta tct ggt 
Val Ser Gly 
65 

aga ggg gta 
Arg Gly Val 



att gat tgg 
lie Asp Trp 



att ctt get 
lie Leu Ala 
115 

eta gtt ggt 
Leu Val Gly 
130 

gga att atg 
Gly lie Met 
145 

gta tac atg 
Val Tyr Met 



aat act gca 
Asn Thr Ala 



ate ata gtc 
lie He Val 
195 

tac eta atg 
Tyr Leu Met 
210 

aac ctt get 
Asn Leu Ala 
225 



aga gat aga 
Arg Asp Arg 



ctt gtt act 
Leu Val Thr 
70 

tgg att gaa 
Trp He Glu 
85 

tta eta aca 
Leu Leu Thr 
100 

get get act 
Ala Ala Thr 



tct ctt gtt 
Ser Leu Val 



gca get tgg 
Ala Ala Trp 
150 

att tat gaa 
He Tyr Glu 
165 

agt cct teg 
Ser Pro Ser 
180 

ttc ggt tgg 
Phe Gly Trp 



ggt gac ggt 
Gly Asp Gly 



gac ttt gtt 
Asp Phe Val 
230 



gtt tct gca 
Val Ser Ala 
55 

ggt att get 
Gly He Ala 



act ggt gat 
Thr Gly Asp 



gtt cct eta 
Val Pro Leu 
105 

aat gtt gec 
Asn Val Ala 
120 

atg ctt gtg 
Met Leu Val 
135 

cct gca ttc 
Pro Ala Phe 



eta tat get 
Leu Tyr Ala 



gtt caa tea 
Val Gin Ser 
185 

gca att tat 
Ala He Tyr 
200 

gga tea get 
Gly Ser Ala 
215 

aac aag att 
Asn Lys He 



aaa tgg aaa 
Lys Trp Lys 
60 

ttc tgg cat 
Phe Trp His 
75 

teg cca act 
Ser Pro Thr 
90 

tta ata tgt 
Leu He Cys 



ggc tea tta 
Gly Ser Leu 



ttt ggt tac 
Phe Gly Tyr 
140 

att att ggg 
He He Gly 
155 

gga gaa gga 
Gly Glu Gly 
170 

get tac aac 
Ala Tyr Asn 



cct gta ggt 
Pro Val Gly 



ctt aac tta 
Leu Asn Leu 
220 

eta ttt ggt 
Leu Phe Gly 
235 



aca tea tta 
Thr Ser Leu 



tac atg tac 
Tyr Met Tyr 



gta ttt aga 
Val Phe Arg 
95 

gaa ttc tac 
Glu Phe Tyr 
110 

ttt aag aaa 
Phe Lys Lys 
125 

atg ggt gaa 
Met Gly Glu 



tgt tta get 
Cys Leu Ala 



aaa tct gca 
Lys Ser Ala 
175 

aca atg atg 
Thr Met Met 
190 

tat ttc aca 
Tyr Phe Thr 
205 

aac ctt att 
Asn Leu He 



tta att ata 
Leu He He 



act 192 
Thr 



atg 240 

Met 

80 

tac 288 
Tyr 



tta 336 
Leu 



ctt 384 
Leu 



gca 432 
Ala 



tgg 480 

Trp 

160 

tgt 528 
Cys 



get 576 
Ala 



ggt 624 
Gly 



tat 672 
Tyr 



tgg 72 0 

Trp 

240 
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aat get get gtt aaa gaa tct tct aat get 
Asn Ala Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



750 



<210> 43 
<211> 250 
<212> PRT 

<213> Naturally occurring gamma proteobacterium 
<400> 43 

Met Gly Lys Leu Leu Leu lie Leu Gly Ser Val lie Ala Leu Pro Thr 
15 10 15 



Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 
20 25 30 



Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 
35 40 45 



Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 



Val Ser Gly Leu Val Thr Gly lie Ala Phe Trp His Tyr Met Tyr Met 
65 70 75 80 



Arg Gly Val Trp lie Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 

85 90 95 



lie Asp Trp Leu Leu Thr Val Pro Leu Leu lie Cys Glu Phe Tyr Leu 

100 105 110 



lie Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 
115 120 125 
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Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 
130 135 140 



Gly He Met Ala Ala Trp Pro Ala Phe He He Gly Cys Leu Ala Trp 
145 150 155 160 



Val Tyr Met He Tyr Glu Leu Tyr Ala Gly Glu Gly Lys Ser Ala Cys 

165 170 175 



Asn Thr Ala Ser Pro Ser Val Gin Ser Ala Tyr Asn Thr Met Met Ala 
180 185 190 



He He Val Phe Gly Trp Ala He Tyr Pro Val Gly Tyr Phe Thr Gly 
195 200 205 



Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu He Tyr 

210 215 220 



Asn Leu Ala Asp Phe Val Asn Lys He Leu Phe Gly Leu He He Trp 
225 230 235 240 



Asn Ala Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 44 

<211> 750 

<212> DNA 

<213> Naturally occurring gamma proteobacterium 

<220> 

<221> CDS 
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<222> (1)..(750) 

<223> Proteorhodopsin variant from per clone MBl00m9; GenBank #AF349993 



<400> 44 

atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca 48 
Met Gly Lys Leu Leu Leu He Leu Gly Ser Val He Ala Leu Pro Thr 
1 5 10 15 



ttt get gca ggt ggt ggt gac ctt gat get agt gat tac act ggt gtt 
Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 
20 25 30 



96 



tct ttt tgg tta gtt act get get tta tta gca tct act gta ttt ttc 144 
Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 
35 40 45 

ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tea tta act 192 
Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 

gta tct ggt ctt gtt act ggt att get ttc tgg cat tac atg tat atg 240 
Val Ser Gly Leu Val Thr Gly He Ala Phe Trp His Tyr Met Tyr Met 
65 70 75 80 

aga ggg gta tgg att gaa act ggt gat teg cca act gta ttt aga tac 288 
Arg Gly Val Trp He Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 

85 90 95 

ata gat tgg tta eta aca gtt cct tta tta ata tgt gaa ttc tac tta 336 
He Asp Trp Leu Leu Thr Val Pro Leu Leu He Cys Glu Phe Tyr Leu 
100 105 110 

att ctt gee get gca act aat gtt get gga tea tta ttt aag aaa tta 384 
He Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 
H5 120 125 

ctt gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gaa gca 432 
Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 
130 135 140 

gga ate atg get gca tgg cct gca ttc att att ggg tgt tta get tgg 480 
Gly He Met Ala Ala Trp Pro Ala Phe He He Gly Cys Leu Ala Trp 
i45 150 155 160 

gta tac atg att tat gaa eta tgg get gga gaa gga aaa tct gca tgt 528 
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Val Tyr Met He Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys 

165 170 175 



aat act gca agt cct get gtg caa tea get tac aac aca atg atg tat 57 6 

Asn Thr Ala Ser Pro Ala Val Gin Ser Ala Tyr Asn Thr Met Met Tyr 
180 185 190 

ate ate ate ttt ggt tgg gcg att tat cct gta ggt tat ttc aca ggt 624 

He He He Phe Gly Trp Ala He Tyr Pro Val Gly Tyr Phe Thr Gly 
195 200 205 

tac ctt atg ggt gac ggt gga tea gca ctt aac tta aac ctt att tat 672 
Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu He Tyr 
210 215 220 

aac ctt get gac ttt gtt aac aag att eta ttt ggt tta att ata tgg 720 
Asn Leu Ala Asp Phe Val Asn Lys He Leu Phe Gly Leu He lie Trp 
225 230 235 240 

aat gtt get gtt aaa gaa tct tct aat get 750 
Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 45 
<211> 250 
<212> PRT 

<213> Naturally occurring gamma proteobacterium 
<400> 45 

Met Gly Lys Leu Leu Leu He Leu Gly Ser Val He Ala Leu Pro Thr 
15 10 15 



Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 
20 25 30 



Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 
35 40 45 
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Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 



Val Ser Gly Leu Val Thr Gly lie Ala Phe Trp His Tyr Met Tyr Met 
65 70 75 80 



Arg Gly Val Trp lie Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 

85 90 95 



lie Asp Trp Leu Leu Thr Val Pro Leu Leu lie Cys Glu Phe Tyr Leu 
100 105 110 



lie Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 
115 120 125 



Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 
130 135 140 



Gly lie Met Ala Ala Trp Pro Ala Phe lie lie Gly Cys Leu Ala Trp 
145 150 155 160 



Val Tyr Met lie Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys 

165 170 175 



Asn Thr Ala Ser Pro Ala Val Gin Ser Ala Tyr Asn Thr Met Met Tyr 
180 185 190 



lie lie lie Phe Gly Trp Ala lie Tyr Pro Val Gly Tyr Phe Thr Gly 
195 200 205 



Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu lie Tyr 
210 215 220 



Asn Leu Ala Asp Phe Val Asn Lys lie Leu Phe Gly Leu lie lie Trp 
225 230 235 240 
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Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 46 

<211> 750 

<212> DNA 

<213> Naturally occurring gamma proteobacterium 
<220> 

<221> CDS 

<222> (1) . . (750) 

<223> Proteorhodopsin variant from per clone MBlOOmlO; GenBank #AF34999 



<400> 46 

atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca 48 
Met Gly Lys Leu Leu Leu lie Leu Gly Ser Val lie Ala Leu Pro Thr 
15 10 15 

ttt get gca ggt ggc ggt gac ctt gat get agt gat tac act ggt gtt 96 
Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 
20 25 30 

tct ttt tgg tta gtt aca get get eta tta gcg tct act gta ttt ttc 144 
Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 
35 40 45 

ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tea tta act 192 
Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 

gta tct ggt ctt gtt act ggt att get ttc tgg cat tac atg tat atg 240 
Val Ser Gly Leu Val Thr Gly lie Ala Phe Trp His Tyr Met Tyr Met 
65 70 75 80 

aga gga gta tgg att gaa act ggt gat teg cca act gta ttt aga tac 288 
Arg Gly Val Trp lie Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 
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85 



90 



95 



att gat tgg 
lie Asp Trp 



att ctt get 
lie Leu Ala 
115 

eta gtt ggt 
Leu Val Gly 
130 

gga ata atg 
Gly He Met 
145 

gta tat atg 
Val Tyr Met 



aat act gca 
Asn Thr Ala 



ate ate ate 
He He He 
195 

tac eta atg 
Tyr Leu Met 
210 

aac ctt get 
Asn Leu Ala 
225 

aat gtt get 
Asn Val Ala 



tta eta aca 
Leu Leu Thr 
100 

get gca act 
Ala Ala Thr 



tct ctt gtt 
Ser Leu Val 



gcg get tgg 
Ala Ala Trp 
150 

att tat gaa 
He Tyr Glu 
165 

agt cct get 
Ser Pro Ala 
180 

gtt ggt tgg 
Val Gly Trp 



ggt gac ggt 
Gly Asp Gly 



gac ttt gtt 
Asp Phe Val 
230 

gtt aaa gaa 
Val Lys Glu 
245 



gtt cct tta 
Val Pro Leu 
105 

aat gtt gee 
Asn Val Ala 
120 

atg ctt gtg 
Met Leu Val 
135 

cct gca ttc 
Pro Ala Phe 



eta tgg get 
Leu Trp Ala 



gta cag tea 
Val Gin Ser 
185 

gca att tat 
Ala He Tyr 
200 

gga tea get 
Gly Ser Ala 
215 

aac aag att 
Asn Lys He 



tct tct aat 
Ser Ser Asn 



tta ata tgt 
Leu He Cys 



ggc tea tta 
Gly Ser Leu 



ttt ggt tac 
Phe Gly Tyr 
140 

ate gtt gga 
He Val Gly 
155 

ggt gaa gga 
Gly Glu Gly 
170 

get tac aac 
Ala Tyr Asn 



cct gta ggt 
Pro Val Gly 



ctt aat eta 
Leu Asn Leu 
220 

eta ttt ggt 
Leu Phe Gly 
235 

get 

Ala 
250 



gaa ttc tac 
Glu Phe Tyr 
110 

ttt aag aaa 
Phe Lys Lys 
125 

atg ggt gaa 
Met Gly Glu 



tgt tta gca 
Cys Leu Ala 



aaa tct gca 
Lys Ser Ala 
175 

aca atg atg 
Thr Met Met 
190 

tat ttc aca 
Tyr Phe Thr 
205 

aac ctt att 
Asn Leu He 



tta att ata 
Leu He lie 



tta 336 
Leu 



ctt 384 
Leu 



gca 432 
Ala 



tgg 480 

Trp 

160 

tgt 528 
Cys 



tat 576 
Tyr 



ggt 624 
Gly 



tat 672 
Tyr 



tgg 720 

Trp 

240 

750 



<210> 47 
<211> 250 
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<212> PRT 

<213> Naturally occurring gamma proteobacterium 



<400> 47 

Met Gly Lys Leu Leu Leu lie Leu Gly Ser Val lie Ala Leu Pro Thr 
15 10 15 



Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 
20 25 30 



Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 

35 40 45 



Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 

50 55 60 



Val Ser Gly Leu Val Thr Gly lie Ala Phe Trp His Tyr Met Tyr Met 
65 70 75 80 



Arg Gly Val Trp lie Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 

85 90 95 



lie Asp Trp Leu Leu Thr Val Pro Leu Leu lie Cys Glu Phe Tyr Leu 
100 105 110 



lie Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 
115 120 125 



Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 
130 135 140 



Gly lie Met Ala Ala Trp Pro Ala Phe lie Val Gly Cys Leu Ala Trp 
145 150 155 160 
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Val Tyr Met lie Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys 

165 170 175 



Asn Thr Ala Ser Pro Ala Val Gin Ser Ala Tyr Asn Thr Met Met Tyr 
180 185 190 



He He He Val Gly Trp Ala He Tyr Pro Val Gly Tyr Phe Thr Gly 
195 200 205 



Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu He Tyr 
210 215 220 



Asn Leu Ala Asp Phe Val Asn Lys He Leu Phe Gly Leu He He Trp 
225 230 235 240 



Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 48 

<211> 753 

<212> DNA 

<213> Naturally occurring gamma proteobacterium 
<220> 

<221> CDS 

<222> (1)..(753) 

<223> Proteorhodopsin variant from per clone PALB1; GenBank #AF349995 



<400> 48 

atg ggt aaa tta tta ctg ata tta ggt agt get att gca ctt cca tea 4 
Met Gly Lys Leu Leu Leu He Leu Gly Ser Ala He Ala Leu Pro Ser 
15 10 15 
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ttt get get get ggt ggc gat eta gat ata agt gat act gtt ggt gtt 96 
Phe Ala Ala Ala Gly Gly Asp Leu Asp lie Ser Asp Thr Val Gly Val 
20 25 30 

tea ttc tgg ctg gtt aca get ggt atg tta gcg gca act gta ttc ttt 144 
Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 
35 40 45 

ttt gta gaa aga gac caa gtc age get aag tgg aaa act tea ctt act 192 
Phe Val Glu Arg Asp Gin Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 

gta tct ggt tta att act ggt ata get ttt tgg cat tat etc tac atg 240 
Val Ser Gly Leu lie Thr Gly lie Ala Phe Trp His Tyr Leu Tyr Met 
65 70 75 80 

aga ggt gtt tgg ata gat act ggt gat aca cca aca gta ttt aga tat 288 
Arg Gly Val Trp lie Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 

85 90 95 

att gat tgg eta tta act gtt cca tta caa atg gtt gag ttc tat eta 33 6 

lie Asp Trp Leu Leu Thr Val Pro Leu Gin Met Val Glu Phe Tyr Leu 
100 105 110 

att ctt get get tgt aca agt gtt get get tea tta ttt aag aag ctt 384 
lie Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 
115 120 125 

eta get ggt tea tta gta atg tta ggt get gga ttt gca ggc gaa get 432 
Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 
130 135 140 

ggt tta get cct gta tta cct get ttc att ctt ggt atg get ggt tgg 480 
Gly Leu Ala Pro Val Leu Pro Ala Phe lie Leu Gly Met Ala Gly Trp 
145 150 155 160 

tta tac atg att tat gag eta cat atg ggt gaa ggt aag get get gta 528 
Leu Tyr Met lie Tyr Glu Leu His Met Gly Glu Gly Lys Ala Ala Val 

165 170 175 

agt act gca agt cct get gtt aac tct get tac aat gca atg atg aag 576 
Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Lys 
180 185 190 

att att gtt att gga tgg gca att tat cct get gga tat get get ggt 624 
lie lie Val lie Gly Trp Ala lie Tyr Pro Ala Gly Tyr Ala Ala Gly 
195 200 205 
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tac eta atg agt ggt gac ggt gta tac get tea aac tta aac ctt ata 672 
Tyr Leu Met Ser Gly Asp Gly Val Tyr Ala Ser Asn Leu Asn Leu lie 
210 215 220 



tat aac ctt get gac ttt gtt aac aag att eta ttt ggt ttg ate att 720 
Tyr Asn Leu Ala Asp Phe Val Asn Lys lie Leu Phe Gly Leu lie lie 
225 230 235 240 

tgg aat gtt get gtt aaa gaa tct tct aat get 753 
Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 49 
<211> 251 
<212> PRT 

<213> Naturally occurring gairana proteobacterium 
<400> 49 

Met Gly Lys Leu Leu Leu lie Leu Gly Ser Ala lie Ala Leu Pro Ser 
15 10 15 



Phe Ala Ala Ala Gly Gly Asp Leu Asp lie Ser Asp Thr Val Gly Val 
20 25 30 



Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 
35 40 45 



Phe Val Glu Arg Asp Gin Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 



Val Ser Gly Leu lie Thr Gly lie Ala Phe Trp His Tyr Leu Tyr Met 
65 70 75 80 



Arg Gly Val Trp lie Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 

85 90 95 
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lie Asp Trp Leu Leu Thr Val Pro Leu Gin Met Val Glu Phe Tyr Leu 
100 105 110 



He Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 
115 120 125 



Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 
130 135 140 



Gly Leu Ala Pro Val Leu Pro Ala Phe He Leu Gly Met Ala Gly Trp 
145 150 155 160 



Leu Tyr Met lie Tyr Glu Leu His Met Gly Glu Gly Lys Ala Ala Val 

165 170 175 



Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Lys 
180 185 190 



He He Val He Gly Trp Ala He Tyr Pro Ala Gly Tyr Ala Ala Gly 
195 200 205 



Tyr Leu Met Ser Gly Asp Gly Val Tyr Ala Ser Asn Leu Asn Leu He 
210 215 220 



Tyr Asn Leu Ala Asp Phe Val Asn Lys He Leu Phe Gly Leu He He 
225 230 235 240 



Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 50 
<211> 753 
<212> DNA 
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<213> Naturally occurring gamma proteobacterium 



<220> 

<221> CDS 

<222> (1)..(753) 

<223> Proteorhodopsin variant from per clone PALB2 ; GenBank #AF349996 



<400> 50 

atg ggt aaa tta tta ctg ata tta ggt agt get att gca ctt cca tea 48 
Met Gly Lys Leu Leu Leu lie Leu Gly Ser Ala lie Ala Leu Pro Ser 
15 10 15 

ttt get get get ggt ggc gat eta gat ata agt gat act gtt ggt gtt 96 
Phe Ala Ala Ala Gly Gly Asp Leu Asp lie Ser Asp Thr Val Gly Val 



tea ttc tgg ctg gtt aca get ggt atg tta gcg gca act gtg ttc ttt 
Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 
35 40 45 



144 



ttt gta gaa aga gac caa gtc age get gag tgg aaa act tea ctt act 
Phe Val Glu Arg Asp Gin Val Ser Ala Glu Trp Lys Thr Ser Leu Thr 
50 55 60 



192 



gta tct ggt tta att act ggt ata get ttt tgg cat tat etc tat atg 
Val Ser Gly Leu lie Thr Gly lie Ala Phe Trp His Tyr Leu Tyr Met 
65 70 75 80 



240 



aga ggt gtt tgg ata gat act ggt gat acc cca aca gta ttc aga tat 
Arg Gly Val Trp lie Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 

85 90 95 



288 



att gat tgg tta tta act gtt cca tta caa atg gtt gag ttc tat eta 
lie Asp Trp Leu Leu Thr Val Pro Leu Gin Met Val Glu Phe Tyr Leu 
100 105 110 



336 



att ctt get get tgt aca agt gtt get get tea tta ttt aag aag ctt 
lie Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 
115 120 125 



384 



eta get ggt tea tta gta atg tta ggt get gga ttt gca ggc gaa get 



432 
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Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 
130 135 140 



gga tta get cct gta tta cct get ttc att att ggt atg get gga tgg 480 
Gly Leu Ala Pro Val Leu Pro Ala Phe lie lie Gly Met Ala Gly Trp 
145 150 155 160 

tta tac atg att tat gag eta tat atg ggt gaa ggt aag get get gta 528 
Leu Tyr Met lie Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 

165 170 175 

agt act gca agt cct get gtt aac tct gca tac aac gca atg atg atg 576 
Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Met 
180 185 190 

att att gtt gtt gga tgg gca att tat cct get gga tat get get ggt 624 
lie lie Val Val Gly Trp Ala lie Tyr Pro Ala Gly Tyr Ala Ala Gly 
195 200 205 

tac eta atg ggt ggc gaa ggt gta tac get tea aac tta aac ctt ata 672 
Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu lie 
210 215 220 

tat aac ctt get gac ttt gtt aac aag att eta ttt ggt ttg ate att 720 
Tyr Asn Leu Ala Asp Phe Val Asn Lys lie Leu Phe Gly Leu lie lie 
225 230 235 240 

tgg aat gtt get gtt aaa gaa tct tct aat get 753 
Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 51 
<211> 251 
<212> PRT 



<213> Naturally occurring gamma proteobacterium 



<400> 51 

Met Gly Lys Leu Leu Leu lie Leu Gly Ser Ala lie Ala Leu Pro Ser 
15 10 15 
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Phe Ala Ala Ala Gly Gly Asp Leu Asp He Ser Asp Thr Val Gly Val 
20 25 30 



Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 
35 40 45 



Phe Val Glu Arg Asp Gin Val Ser Ala Glu Trp Lys Thr Ser Leu Thr 
50 55 60 



Val Ser Gly Leu He Thr Gly He Ala Phe Trp His Tyr Leu Tyr Met 
65 70 75 80 



Arg Gly Val Trp He Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 

85 90 95 



He Asp Trp Leu Leu Thr Val Pro Leu Gin Met Val Glu Phe Tyr Leu 
100 105 110 



He Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 
115 120 125 



Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 
130 135 140 



Gly Leu Ala Pro Val Leu Pro Ala Phe He He Gly Met Ala Gly Trp 
145 150 155 160 



Leu Tyr Met He Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 

165 170 175 



Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Met 
180 185 190 



He He Val Val Gly Trp Ala He Tyr Pro Ala Gly Tyr Ala Ala Gly 
195 200 205 
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Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu lie 
210 215 220 



Tyr Asn Leu Ala Asp Phe Val Asn Lys lie Leu Phe Gly Leu lie lie 
225 230 235 240 



Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 52 

<211> 753 

<212> DNA 

<213> Naturally occurring gamma proteobacterium 
<220> 

<221> CDS 

<222> (1)..(753) 

<223> Proteorhodopsin variant from per clone PALB5 ; GenBank#AF349997 



<400> 52 

atg ggt aaa tta tta ctg ata tta ggt agt get att gca ctt cca tea 48 

Met Gly Lys Leu Leu Leu lie Leu Gly Ser Ala lie Ala Leu Pro Ser 

15 10 15 

ttt get get get ggt ggc gat eta gat ata agt gat act gtt ggt gtt 96 
Phe Ala Ala Ala Gly Gly Asp Leu Asp lie Ser Asp Thr Val Gly Val 
20 25 30 

tea ttc tgg ctg gtt aca get ggt atg tta gcg gca act gtg ttc ttt 144 
Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 
35 40 45 

ttt gta gaa aga gac caa gtc age get aag tgg aaa act tea ctt act 192 
Phe Val Glu Arg Asp Gin Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 
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gta tct ggt 
Val Ser Gly 
65 

aga ggt gtt 
Arg Gly Val 



att gat tgg 
lie Asp Trp 



att ctt get 
lie Leu Ala 
115 

eta get ggt 
Leu Ala Gly 
130 

gga tta get 
Gly Leu Ala 
145 

tta tac atg 
Leu Tyr Met 



agt act gca 
Ser Thr Ala 



att att gtt 
He He Val 
195 

tac eta atg 
Tyr Leu Met 
210 

tat aac ctt 
Tyr Asn Leu 
225 

tgg aat gtt 
Trp Asn Val 



tta att act 
Leu He Thr 
70 

tgg ata gac 
Trp He Asp 
85 

tta tta act 
Leu Leu Thr 
100 

get tgt aca 
Ala Cys Thr 



tea tta gta 
Ser Leu Val 



cct gta tgg 
Pro Val Trp 
150 

att tat gag 
He Tyr Glu 
165 

agt cct get 
Ser Pro Ala 
180 

gtt gga tgg 
Val Gly Trp 



ggt ggc gaa 
Gly Gly Glu 



get gac ttt 
Ala Asp Phe 
230 

get gtt aaa 
Ala Val Lys 
245 



ggt ata gee 
Gly He Ala 



act ggt gat 
Thr Gly Asp 



gtt cca tta 
Val Pro Leu 
105 

aat gtt get 
Asn Val Ala 
120 

atg tta ggt 
Met Leu Gly 
135 

cct get ttc 
Pro Ala Phe 



eta tat atg 
Leu Tyr Met 



gtt aac tct 
Val Asn Ser 
185 

gca att tat 
Ala He Tyr 
200 

ggt gta tac 
Gly Val Tyr 
215 

gtt aac aag 
Val Asn Lys 



gaa tct tct 
Glu Ser Ser 



ttt tgg cat 
Phe Trp His 
75 

acc cca aca 
Thr Pro Thr 
90 

caa atg gtt 
Gin Met Val 



get tea tta 
Ala Ser Leu 



get gga ttt 
Ala Gly Phe 
140 

att att ggt 
He He Gly 
155 

ggt gaa ggt 
Gly Glu Gly 
170 

gca tac aac 
Ala Tyr Asn 



cct get gga 
Pro Ala Gly 



get tea aac 
Ala Ser Asn 
220 

att eta ttt 
He Leu Phe 
235 

aat got 
Asn Ala 
250 



tat etc tat 
Tyr Leu Tyr 



gta ttc aga 
Val Phe Arg 
95 

gag ttc tat 
Glu Phe Tyr 
110 

ttt aag aag 
Phe Lys Lys 
125 

gca ggc gaa 
Ala Gly Glu 



atg get gga 
Met Ala Gly 



aag get get 
Lys Ala Ala 
175 

gca atg atg 
Ala Met Met 
190 

tat get get 
Tyr Ala Ala 
205 

eta aac ctt 
Leu Asn Leu 



ggt ttg ate 
Gly Leu He 



atg 240 

Met 

80 

tat 288 
Tyr 



eta 336 
Leu 



ctt 384 
Leu 



get 432 
Ala 



tgg 480 

Trp 

160 

gta 528 
Val 



atg 576 
Met 



ggt 624 
Gly 



ata 672 
He 



att 720 

He 

240 

753 
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<210> 53 
<211> 251 
<212> PRT 

<213> Naturally occurring gamma proteobacterium 
<400> 53 

Met Gly Lys Leu Leu Leu lie Leu Gly Ser Ala lie Ala Leu Pro Ser 
15 10 15 



Phe Ala Ala Ala Gly Gly Asp Leu Asp He Ser Asp Thr Val Gly Val 
20 25 30 



Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 
35 40 45 



Phe Val Glu Arg Asp Gin Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 



Val Ser Gly Leu He Thr Gly He Ala Phe Trp His Tyr Leu Tyr Met 
65 70 75 80 



Arg Gly Val Trp He Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 

85 90 95 



He Asp Trp Leu Leu Thr Val Pro Leu Gin Met Val Glu Phe Tyr Leu 
100 105 110 



He Leu Ala Ala Cys Thr Asn Val Ala Ala Ser Leu Phe Lys Lys Leu 
115 120 125 



Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 
130 135 140 



Page 162 



Gly Leu Ala Pro Val Trp Pro Ala Phe lie xl e Qly Met Ala Gly Trp 

lb0 155 



160 



Leu Tyr Met He Tyr 01u Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 

155 170 175 

Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Met 

185 190 

He lie val Val Gly Trp Ala lie Tyr Pro Ala Gly Tyr Ala Ala Gly 
195 200 205 

Tyr Leu Met Gly Gly Glu Glv Val tw n= o 

210 y ~T Y Val Tyr Ala Ser Asn Leu Asn Leu He 

215 220 

Tyr Asn Leu Ala Asp Phe Val Asn Lys He Leu Phe Gly Leu He He 

230 235 240 

Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

24 5 250 

<210> ,54 
<211> 753 
<212> DNA 

<213> Naturally occurring gamma proteobacterium 



<220> 

<221> CDS 

<222> (1)..(753) 

<223> Proteorhodopsin variant from per clone PalB7; GenBank #AF349999 
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<400> 54 

atg ggt aaa tta tta ctg ata tta ggt agt get att gcg ctt cca tea 48 

Met Gly Lys Leu Leu Leu lie Leu Gly Ser Ala lie Ala Leu Pro Ser 
15 10 15 

ttt get get get ggt ggc gat eta gat ata agt gat act gtt ggt gtt 96 
Phe Ala Ala Ala Gly Gly Asp Leu Asp lie Ser Asp Thr Val Gly Val 
20 25 30 

tea ttc tgg ctg gtt acg get ggt atg tta gcg gca act gta ttc ttt 144 
Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 
35 40 45 

ttt gta gaa aga gac caa gtc age get aag tgg aaa act tea ctt act 192 
Phe Val Glu Arg Asp Gin Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 

gta tct ggt tta att act ggt ata get ttt tgg cat tat etc tac atg 240 
Val Ser Gly Leu lie Thr Gly lie Ala Phe Trp His Tyr Leu Tyr Met 
65 70 75 80 

aga ggt gtt tgg ata gat act ggt gat aca cca aca gta ttt aga tat 2 88 

Arg Gly Val Trp lie Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 

85 90 95 

att gat tgg tta tta act gtt cca tta caa atg gtt gag ttc tat eta 336 
lie Asp Trp Leu Leu Thr Val Pro Leu Gin Met Val Glu Phe Tyr Leu 
100 105 110 

att ctt gec get tgt aca agt gtt get get tea tta ttt aag aag ctt 384 
lie Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 
115 120 125 

eta get ggt tea ttg gta atg tta ggt get gga tct gca ggc gaa get 432 
Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Ser Ala Gly Glu Ala 
130 135 140 

gga tta get cct gta tta cct get ttc att att ggt atg get gga tgg 480 
Gly Leu Ala Pro Val Leu Pro Ala Phe lie lie Gly Met Ala Gly Trp 
145 150 155 160 

tta tac atg att tat gag eta tat atg ggt gaa ggt aag get get gta 52 8 

Leu Tyr Met He Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 

165 170 175 

agt act gca agt cct get gtt aac tct gca tac aac gca atg atg atg 576 
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Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Met 
180 185 190 



att att gtt gtt gga tgg gca att tat cct get gga tat get get ggt 624 

lie lie Val Val Gly Trp Ala lie Tyr Pro Ala Gly Tyr Ala Ala Gly 
195 200 205 

tac eta atg ggt ggc gaa ggt gta tac get tea aac tta aac etc ata 672 

Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu lie 
210 215 220 

tat aac ctt get gac ttt gtt aac aag att eta ttt ggt ttg ate att 720 

Tyr Asn Leu Ala Asp Phe Val Asn Lys lie Leu Phe Gly Leu lie lie 

225 230 235 240 

tgg aat gtt get gtt aaa gaa tct tct aat get 753 

Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 55 
<211> 251 
<212> PRT 

<213> Naturally occurring gamma proteobacterium 
<400> 55 

Met Gly Lys Leu Leu Leu lie Leu Gly Ser Ala lie Ala Leu Pro Ser 
15 10 15 



Phe Ala Ala Ala Gly Gly Asp Leu Asp lie Ser Asp Thr Val Gly Val 
20 25 30 



Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 
35 40 45 



Phe Val Glu Arg Asp Gin Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 
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Val Ser Gly Leu lie Thr Gly lie Ala Phe Trp His Tyr Leu Tyr Met 
65 70 75 80 



Arg Gly Val Trp lie Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 

85 90 95 



lie Asp Trp Leu Leu Thr Val Pro Leu Gin Met Val Glu Phe Tyr Leu 
100 105 110 



lie Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 
115 120 125 



Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Ser Ala Gly Glu Ala 
130 135 140 



Gly Leu Ala Pro Val Leu Pro Ala Phe lie lie Gly Met Ala Gly Trp 
145 150 155 160 



Leu Tyr Met lie Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 

165 170 175 



Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Met 
180 185 190 



lie lie Val Val Gly Trp Ala lie Tyr Pro Ala Gly Tyr Ala Ala Gly 
195 200 205 



Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu lie 
210 215 220 



Tyr Asn Leu Ala Asp Phe Val Asn Lys lie Leu Phe Gly Leu lie lie 
225 230 235 240 



Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 
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<210> 56 

<211> 753 

<212> DNA 

<213> Naturally occurring gamma proteobacterium 
<220> 

<221> CDS 

<222> (1) . . (753) 

<223> Proteorhodopsin variant from per clone PalB6; GenBank # AF349998 



<400> 56 
atg ggt aaa 
Met Gly Lys 
1 

ttt get get 
Phe Ala Ala 



tea ttc tgg 
Ser Phe Trp 
35 

ttt gta gaa 
Phe Val Glu 
50 

gta tct ggt 
Val Ser Gly 
65 

aga ggt gtt 
Arg Gly Val 



att gat tgg 
lie Asp Trp 



tta tta ctg 
Leu Leu Leu 
5 

get ggt ggc 
Ala Gly Gly 
20 

ctg gtt aca 
Leu Val Thr 



aga gac caa 
Arg Asp Gin 



tta att act 
Leu lie Thr 
70 

tgg ata gac 
Trp lie Asp 
85 

tta tta act 
Leu Leu Thr 



ata tta ggt 
lie Leu Gly 



gat eta gat 
Asp Leu Asp 
25 

get ggt atg 
Ala Gly Met 
40 

gtc age get 
Val Ser Ala 
55 

ggt ata get 
Gly lie Ala 



act ggt gat 
Thr Gly Asp 



gtt cca tta 
Val Pro Leu 



agt get att 
Ser Ala lie 
10 

ata agt gat 
lie Ser Asp 



tta gcg gca 
Leu Ala Ala 



aag tgg aaa 
Lys Trp Lys 
60 

ttt tgg cat 
Phe Trp His 
75 

acc cca aca 
Thr Pro Thr 
90 

caa atg gtt 
Gin Met Val 



gca ctt cca 
Ala Leu Pro 
15 

act gtt ggt 
Thr Val Gly 
30 

act gtg ttc 
Thr Val Phe 
45 

act tea ctt 
Thr Ser Leu 



tat etc tat 
Tyr Leu Tyr 



gta ttc aga 
Val Phe Arg 
95 

gag ttc tat 
Glu Phe Tyr 



tea 48 
Ser 



gtt 96 
Val 



ttt 144 
Phe 



act 192 
Thr 



atg 240 

Met 

80 

tat 288 
Tyr 



eta 336 
Leu 
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100 



105 



110 



att ctt get get tgt aca aat gtt get get tea tta ttt aag aag ctt 384 
lie Leu Ala Ala Cys Thr Asn Val Ala Ala Ser Leu Phe Lys Lys Leu 
115 120 125 

eta get ggt tea tta gta atg tta ggt get gga ttt gca ggc gaa get 432 
Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 
130 135 140 

gga tta get cct gta tgg cct get ttc att att ggt atg get gga tgg 480 
Gly Leu Ala Pro Val Trp Pro Ala Phe lie lie Gly Met Ala Gly Trp 
145 150 155 160 

tta tac atg att tat gag eta tat atg ggt gaa ggt aag get get gta 528 
Leu Tyr Met lie Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 

165 170 175 

agt act gca agt cct get gtt aac tct gca tac aac gca atg atg gtg 576 
Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Val 
180 185 190 

att att gtt gtt gga tgg gca att tat cct get gga tat get get ggt 624 
He He Val Val Gly Trp Ala He Tyr Pro Ala Gly Tyr Ala Ala Gly 
195 200 205 

tac eta atg ggt ggc gaa ggt gta tac get tea aac eta aac ctt ata 672 
Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu He 
210 215 220 

tat aac ctt get gac ttt gtt aac aag att eta ttt ggt ttg ate att 72 0 

Tyr Asn Leu Ala Asp Phe Val Asn Lys He Leu Phe Gly Leu He He 
225 230 235 240 

tgg aat gtt get gtt aaa gaa tct tct aat get 753 
Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 57 

<211> 251 

<212> PRT 

<213> Naturally occurring gamma proteobacterium 
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<400> 57 



Met Gly Lys Leu Leu Leu lie Leu Gly Ser Ala lie Ala Leu Pro Ser 
15 10 15 



Phe Ala Ala Ala Gly Gly Asp Leu Asp lie Ser Asp Thr Val Gly Val 
20 25 30 



Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 
35 40 45 



Phe Val Glu Arg Asp Gin Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 



Val Ser Gly Leu lie Thr Gly lie Ala Phe Trp His Tyr Leu Tyr Met 
65 70 75 80 



Arg Gly Val Trp lie Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 

85 90 95 



lie Asp Trp Leu Leu Thr Val Pro Leu Gin Met Val Glu Phe Tyr Leu 
100 105 110 



lie Leu Ala Ala Cys Thr Asn Val Ala Ala Ser Leu Phe Lys Lys Leu 
115 120 125 



Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 
130 135 140 



Gly Leu Ala Pro Val Trp Pro Ala Phe lie lie Gly Met Ala Gly Trp 
145 150 155 160 



Leu Tyr Met lie Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 

165 170 175 
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Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Val 
180 185 190 



lie lie Val Val Gly Trp Ala lie Tyr Pro Ala Gly Tyr Ala Ala Gly 
195 200 205 



Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu lie 
210 215 220 



Tyr Asn Leu Ala Asp Phe Val Asn Lys lie Leu Phe Gly Leu lie lie 
225 230 235 240 



Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 58 

<211> 753 

<212> DNA 

<213> Naturally occurring gamma proteobacteria 
<220> 

<221> CDS 

<222> (1)..(753) 

<223> Proteorhodopsin variant from per clone PalB8; GenBank #AF350000 



<400> 58 

atg ggt aaa tta tta ctg ata tta ggt agt get att gca ctt cca tea 48 
Met Gly Lys Leu Leu Leu lie Leu Gly Ser Ala lie Ala Leu Pro Ser 
15 10 15 

ttt get get get ggt ggc gat eta gat ata agt gat act gtt ggt gtt 96 
Phe Ala Ala Ala Gly Gly Asp Leu Asp lie Ser Asp Thr Val Gly Val 
20 25 30 
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tea ttc tgg ctg gtt aca get ggt atg tta gcg gca act gtg ttc ttt 144 
Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 
35 40 45 

ttt gta gaa aga gac caa gtc age get aag tgg aaa act tea ctt act 192 
Phe Val Glu Arg Asp Gin Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 

gta tct ggt tta att act ggt ata get ttt tgg cat tat etc tat atg 240 
Val Ser Gly Leu He Thr Gly He Ala Phe Trp His Tyr Leu Tyr Met 
6 5 70 75 80 

aga ggt gtt tgg ata gac act ggt gat ace cca aca gta ttc aga tat 288 
Arg Gly Val Trp He Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 

85 90 95 

att gat tgg tta tta act gtt cca tta caa atg gtt gag ttc tat eta 336 
He Asp Trp Leu Leu Thr Val Pro Leu Gin Met Val Glu Phe Tyr Leu 
100 105 110 

att ctt get get tgt aca agt gtt get get tea tta ttt aag aag ctt 3 84 

He Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 
115 120 125 

eta get ggt tea tta gta atg tta ggt get gga ttt gca ggc gaa get 432 
Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 
130 135 140 

gga tta get cct gta tta cct get ttc att att ggt atg get gga tgg 480 
Gly Leu Ala Pro Val Leu Pro Ala Phe He He Gly Met Ala Gly Trp 
14 5 150 155 160 

tta tac atg att tat gag eta tat atg ggt gaa ggt aag get get gta 528 
Leu Tyr Met He Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 

165 170 175 

agt act gca agt cct get gtt aac tct gca tac aac gca atg atg atg 576 
Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Met 
180 185 190 

att att gtt gtt gga tgg gca att tat cct get gga tat get get ggt 624 
He He Val Val Gly Trp Ala He Tyr Pro Ala Gly Tyr Ala Ala Gly 
195 200 205 

tac eta atg ggt ggc gaa ggt gta tac get tea aac tta aac ctt ata 672 
Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu He 
210 215 220 
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tat aac ctt get gac ctt gtt aac aag att eta ttt ggt ttg ate att 720 
Tyr Asn Leu Ala Asp Leu Val Asn Lys lie Leu Phe Gly Leu lie lie 
225 230 235 240 

tgg aat gtt get gtt aaa gaa tct tct aat get 753 
Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 59 
<211> 251 
<212> PRT 

<213> Naturally occurring gamma proteobacteria 
<400> 59 

Met Gly Lys Leu Leu Leu lie Leu Gly Ser Ala lie Ala Leu Pro Ser 
15 10 15 



Phe Ala Ala Ala Gly Gly Asp Leu Asp He Ser Asp Thr Val Gly Val 

20 25 30 



Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 
35 40 45 



Phe Val Glu Arg Asp Gin Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 



Val Ser Gly Leu He Thr Gly He Ala Phe Trp His Tyr Leu Tyr Met 
65 70 75 80 



Arg Gly Val Trp He Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 

85 90 95 



He Asp Trp Leu Leu Thr Val Pro Leu Gin Met Val Glu Phe Tyr Leu 
100 105 110 
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lie Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 
115 120 125 



Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 
130 135 140 



Gly Leu Ala Pro Val Leu Pro Ala Phe lie lie Gly Met Ala Gly Trp 

145 150 155 160 



Leu Tyr Met lie Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 

165 170 175 



Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Met 
180 185 190 



lie lie Val Val Gly Trp Ala lie Tyr Pro Ala Gly Tyr Ala Ala Gly 
195 200 205 



Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu lie 
210 215 220 



Tyr Asn Leu Ala Asp Leu Val Asn Lys lie Leu Phe Gly Leu lie lie 
225 230 235 240 



Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 60 

<211> 753 

<212> DNA 

<213> Naturally occurring gamma proteobacteria 
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<220> 

<221> CDS 

<222> (1) . . (753) 

<223> Proteorhodopsin variant from per clone PalEl ;GenBank# AF350001 



<400> 60 

atg ggt aaa tta tta ctg ata tta ggt agt get att gca ctt cca tea 48 

Met Gly Lys Leu Leu Leu lie Leu Gly Ser Ala lie Ala Leu Pro Ser 
15 10 15 

ttt get get get ggt ggc gat eta gat ata agt gat act gtt ggt gtt 96 

Phe Ala Ala Ala Gly Gly Asp Leu Asp lie Ser Asp Thr Val Gly Val 
20 25 30 

tea ttc tgg ctg gtt aca get ggt atg tta gcg gca act gtg ttc ttt 144 

Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 
35 40 45 

ttt gta gaa aga gac caa gtc age get aag tgg aaa act tea ctt act 192 

Phe Val Glu Arg Asp Gin Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 

gta tct ggt tta att act ggt ata get ttt tgg cat tat etc tat atg 240 

Val Ser Gly Leu lie Thr Gly lie Ala Phe Trp His Tyr Leu Tyr Met 

65 70 75 80 

a 9 a SFSTt gtt tgg ata gac act ggt gat ace cca aca gta ttc aga tat 288 

Arg Gly Val Trp lie Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 

85 90 95 

att gat tgg tta tta act gtt cca tta caa gtg gtt gag ttc tat eta 336 

lie Asp Trp Leu Leu Thr Val Pro Leu Gin Val Val Glu Phe Tyr Leu 
100 105 110 

att ctt get get tgt aca agt gtt get get tea tta ttt aag aag ctt 384 

lie Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 
115 120 125 

eta get ggt tea tta gta atg tta ggt get gga ttt gca ggc gaa get 432 

Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 
130 135 140 

gga tta get cct gta tta cct get ttc att att ggt atg get gga tgg 480 
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Gly Leu Ala Pro Val Leu Pro Ala Phe lie lie Gly Met Ala Gly Trp 
145 150 155 160 



tta tac atg att tat gag eta tat atg ggt gaa ggc aag get get gta 528 
Leu Tyr Met lie Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 

165 170 175 

agt act gca agt cct get gtt aac cct gca tac aac gca atg atg atg 576 
Ser Thr Ala Ser Pro Ala Val Asn Pro Ala Tyr Asn Ala Met Met Met 
180 185 190 

att att gtt gtt gga tgg gca att tat cct get gga tat get get ggt 624 
lie lie Val Val Gly Trp Ala lie Tyr Pro Ala Gly Tyr Ala Ala Gly 
195 200 205 

tac eta atg ggt ggc gaa ggt gta tac get tea aac tta aac ctt ata 672 
Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu lie 
210 215 220 

tat aac ctt get gac ttt gtt aac aag att eta ttt ggt ttg ate att 72 0 

Tyr Asn Leu Ala Asp Phe Val Asn Lys lie Leu Phe Gly Leu lie lie 
225 230 235 240 

tgg aat gtt get gtt aaa gaa tct tct aat get 753 
Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 61 
<211> 251 
<212> PRT 

<213> Naturally occurring gamma proteobacteria 
<400> 61 

Met Gly Lys Leu Leu Leu lie Leu Gly Ser Ala lie Ala Leu Pro Ser 
15 10 15 



Phe Ala Ala Ala Gly Gly Asp Leu Asp lie Ser Asp Thr Val Gly Val 
20 25 30 
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Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 

35 40 45 



Phe Val Glu Arg Asp Gin Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 



Val Ser Gly Leu lie Thr Gly lie Ala Phe Trp His Tyr Leu Tyr Met 
65 70 75 80 



Arg Gly Val Trp lie Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 

85 90 95 



lie Asp Trp Leu Leu Thr Val Pro Leu Gin Val Val Glu Phe Tyr Leu 

100 105 110 



lie Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 
115 120 125 



Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 
130 135 140 



Gly Leu Ala Pro Val Leu Pro Ala Phe lie lie Gly Met Ala Gly Trp 

145 150 155 160 



Leu Tyr Met lie Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 

165 170 175 



Ser Thr Ala Ser Pro Ala Val Asn Pro Ala Tyr Asn Ala Met Met Met 
180 185 190 



lie lie Val Val Gly Trp Ala He Tyr Pro Ala Gly Tyr Ala Ala Gly 
195 200 205 



Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu He 
210 215 220 
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Tyr Asn Leu Ala Asp Phe Val Asn Lys lie Leu Phe Gly Leu lie lie 
225 230 235 240 



Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 62 

<211> 753 

<212> DNA 

<213> Naturally occurring gamma proteobacterium 



<220> 

<221> CDS 

<222> (1) . . (753) 

<223> Proteorhodopsin variant from per clone PalE6; GenBank#AF350002 



<400> 62 

atg ggt aaa tta tta ctg ata tta ggt 
Met Gly Lys Leu Leu Leu lie Leu Gly 
1 5 

ttt get get get ggt ggc gat eta gat 
Phe Ala Ala Ala Gly Gly Asp Leu Asp 
20 25 

tea ttc tgg ctg gtt aca get ggt atg 
Ser Phe Trp Leu Val Thr Ala Gly Met 
35 40 

ttt gta gaa aga gac caa gtc age get 
Phe Val Glu Arg Asp Gin Val Ser Ala 
50 55 

gta tct ggt tta att act ggt ata get 
Val Ser Gly Leu lie Thr Gly lie Ala 
65 70 



agt get att gca ctt cca tea 48 
Ser Ala lie Ala Leu Pro Ser 
10 ' 15 

ata agt gat act gtt ggt gtt 96 
lie Ser Asp Thr Val Gly Val 

30 

tta gcg gca act gta ttc ttt 144 
Leu Ala Ala Thr Val Phe Phe 
45 

aag tgg aaa act tea ctt act 192 
Lys Trp Lys Thr Ser Leu Thr 
60 

ttt tgg cat tat etc tac atg 240 
Phe Trp His Tyr Leu Tyr Met 
75 80 
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aga ggt gtt tgg ata gat act ggt gat aca cca aca gta ttt aga tat 
Arg Gly Val Trp He Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 

85 90 95 

att gat tgg tta tta act gtt cca tta caa atg gtt gag ttc tat eta 
He Asp Trp Leu Leu Thr Val Pro Leu Gin Met Val Glu Phe Tyr Leu 
100 105 HO 

att ctt get get tgt aca agt gtt get get tea tta ttt aag aag ctt 
He Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 
115 120 125 

eta get ggt tea tta gta atg tta ggt get gga ttt gca ggc gaa get 
Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 
130 135 140 

ggt tta get cct gta tta cct get ttc att att ggt atg get gga tgg 
Gly Leu Ala Pro Val Leu Pro Ala Phe He He Gly Met Ala Gly Trp 
145 150 155 160 

tta tac atg att tat gag eta cat atg ggt gaa ggt aag get get gta 
Leu Tyr Met He Tyr Glu Leu His Met Gly Glu Gly Lys Ala Ala Val 

165 170 175 

agt act gca agt cct get gtt aac tct gca tac aac gca atg atg aag 
Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Lys 
180 185 190 

att att gtt att gga tgg gca att tat cct get gga tat get get ggt 
He He Val He Gly Trp Ala He Tyr Pro Ala Gly Tyr Ala Ala Gly 
195 200 205 

tac eta atg agt ggt gac ggt gta tac get tea aac tta aac ctt ata 
Tyr Leu Met Ser Gly Asp Gly Val Tyr Ala Ser Asn Leu Asn Leu He 
210 215 220 

tat aac ctt get gac ttt gtt aac aag att eta ttt ggt ttg ate att 
Tyr Asn Leu Ala Asp Phe Val Asn Lys He Leu Phe Gly Leu He He 
225 230 235 240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



tgg aat gtt get gtt aaa gaa tct tct aat get 753 
Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 63 
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<211> 251 
<212> PRT 

<213> Naturally occurring gamma proteobacterium 

<400> 63 

Met Gly Lys Leu Leu Leu He Leu Gly Ser Ala He Ala Leu Pro Ser 

15 10 15 



Phe Ala Ala Ala Gly Gly Asp Leu Asp He Ser Asp Thr Val Gly Val 
20 25 30 



Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 
35 40 45 



Phe Val Glu Arg Asp Gin Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 



Val Ser Gly Leu He Thr Gly He Ala Phe Trp His Tyr Leu Tyr Met 
65 70 75 80 



Arg Gly Val Trp He Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 

85 90 95 



He Asp Trp Leu Leu Thr Val Pro Leu Gin Met Val Glu Phe Tyr Leu 
100 105 HO 



He Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 
115 120 125 



Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 
130 135 140 



Gly Leu Ala Pro Val Leu Pro Ala Phe He He Gly Met Ala Gly Trp 
145 150 155 160 
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Leu Tyr Met He Tyr Glu Leu His Met Gly Glu Gly Lys Ala Ala Val 

165 170 175 



Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Lys 
180 185 190 

He He Val He Gly Trp Ala He Tyr Pro Ala Gly Tyr Ala Ala Gly 

195 200 205 

Tyr Leu Met Ser Gly Asp Gly Val Tyr Ala Ser Asn Leu Asn Leu He 
210 215 220 

Tyr Asn Leu Ala Asp Phe Val Asn Lys He Leu Phe Gly Leu He He 
225 230 235 240 

Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 

<210> 64 
<211> 753 
<212> DNA 

<213> Naturally occurring gamma proteobacterium 
<220> 

<221> CDS 

<222> (1)..(753) 

<223> Proteorhodopsin variant from per clone PalE7 ; GenBank* AF350003 
<400> 64 

atg ggt aaa tta tta ctg ata tta ggt agt get att gca ctt cca tea 
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Met Gly Lys Leu Leu Leu lie Leu Gly Ser Ala He Ala Leu Pro Ser 
15 10 15 

ttt get get get ggt ggc gat eta gat ata agt gat act gtt ggt gtt 
Phe Ala Ala Ala Gly Gly Asp Leu Asp He Ser Asp Thr Val Gly Val 
20 25 30 

tea ttc tgg ctg gtt aca get ggt atg tta gcg gca act gtg ttc ttt 
Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 
35 40 45 

ttt gta gaa aga gac caa gtc age get aag tgg aaa act tea ctt act 
Phe Val Glu Arg Asp Gin Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 

gta tct ggt tta att act ggt ata get ttt tgg cat tat etc tat atg 
Val Ser Gly Leu He Thr Gly He Ala Phe Trp His Tyr Leu Tyr Met 
65 70 75 80 

aga ggt gtt tgg ata gat act ggt gat acc cca aca gta ttc aga tat 
Arg Gly Val Trp He Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 

85 90 95 

att gat tgg tta tta act gtt cca tta caa atg gtt gag ttc tat eta 
He Asp Trp Leu Leu Thr Val Pro Leu Gin Met Val Glu Phe Tyr Leu 
100 105 HO 

att ctt get get tgt aca agt gtt get get tea tta ttt aag aag ctt 
He Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 
115 120 125 

eta get ggt tea tta gta atg tta ggt get gga ttt gca ggc gaa get 
Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 
130 135 140 

gga tta get cct gta tta cct get ttc att att ggt atg get gga tgg 
Gly Leu Ala Pro Val Leu Pro Ala Phe He He Gly Met Ala Gly Trp 
145 150 155 160 

eta tac atg att tat gag eta tat atg ggt gaa ggt aag get get gta 
Leu Tyr Met He Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 

165 170 175 

agt act gca agt cct get gtt aac tct gca tac aac gca atg atg atg 
Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Met 
180 185 190 

att att gtt gtt gga tgg gca att tat cct get gga tat get get ggt 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 
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He He Val Val Gly Trp Ala He Tyr Pro Ala Gly Tyr Ala Ala Gly 
195 200 205 



720 



tac eta atg ggt ggc gaa ggc gta tac get tea aac tta aac ctt ata 672 
Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu He 
210 215 220 

tat aac ctt get gac ttt gtt aac aag att eta ttt ggt ttg ate att 
Tyr Asn Leu Ala Asp Phe Val Asn Lys He Leu Phe Gly Leu He He 
225 230 235 240 

tgg aat gtt get gtt aaa gaa tct tct aat get 753 
Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 



<210> 65 
<211> 251 
<212> PRT 

<213> Naturally occurring gamma proteobacterium 
<400> 65 

Met Gly Lys Leu Leu Leu He Leu Gly Ser Ala He Ala Leu Pro Ser 
15 10 15 



Phe Ala Ala Ala Gly Gly Asp Leu Asp He Ser Asp Thr Val Gly Val 
20 25 30 



Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 
35 40 45 



Phe Val Glu Arg Asp Gin Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 
50 55 60 



Val Ser Gly Leu He Thr Gly He Ala Phe Trp His Tyr Leu Tyr Met 
65 70 75 80 
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Arg Gly Val Trp He Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 

'85 90 95 



He Asp Trp Leu Leu Thr Val Pro Leu Gin Met Val Glu Phe Tyr Leu 
100 105 HO 



He Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 
115 120 125 



Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 
130 135 140 



Gly Leu Ala Pro Val Leu Pro Ala Phe He He Gly Met Ala Gly Trp 
145 150 155 160 



Leu Tyr Met He Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 

165 170 175 



Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Met 
180 185 190 



He He Val Val Gly Trp Ala He Tyr Pro Ala Gly Tyr Ala Ala Gly 

195 200 205 



Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu He 
210 215 220 



Tyr Asn Leu Ala Asp Phe Val Asn Lys He Leu Phe Gly Leu He He 
225 230 235 240 



Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 

245 250 
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